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Introduction 


BASILIO CALDERONE! 
WOLFGANG U. DRESSLER? 


Language sounds are realized in several different ways. Every lan- 
guage exploits no more than a subset of the sounds that the vocal tract 
can produce, as well as a reduced number of their possible combinations. 
The restrictions and the phonemic combinations allowed in the language 
define a branch of phonology called phonotactics. 

Phonotactics refers to the sequential arrangement of phonemic seg- 
ments in morphemes, syllables, and words (Harris 1955) and under- 
lies a wide range of phonological issues, from acceptability judgments 
(pseudowords like legal <blick> vs. illegal <bnik> in English or legal 
<Pfraus> vs. illegal <Xraus> in German) to syllable processes (the syl- 
labic structure in a given language is based on phonotactic permissions 
in that language) and the nature and length of possible consonant clusters 
(which may be seen as intrinsically marked structures with respect to the 
preferred CV template). This volume deals only with consonant clusters. 

The study of phonotactics entails a set of problematic aspects due to 
its nature. In fact, if, on the one hand, phonotactics is part of the phono- 
logical grammar of the language and appears as a rules-based system, on 
the other, it is controlled by a number of non-categorical, probabilistic 
and gradient constraints. Often the researcher is faced with a series of 
apparent contradictions and empirical problems that require critical com- 
parisons of alternative explanatory models and, most often, an investiga- 
tion of the ‘interfaces’ and ‘intersections’ between phonotactics and other 
levels of linguistic organization, particularly phonetics or, instead, only 
phonology and morphology. 

However, this volume focuses on experimental, acquisitional and cor- 
pus linguistic aspects of morphonotactics, which represents an intersec- 
tion area between phonotactics and the morphemic structure of the lan- 
guage. 

In particular, morphonotactics deals with the interplay between the 
ordering restrictions of morphemes (the so-called morphotactics) and the 
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phonemic sequences of consonant clusters (phonotactics). More specifi- 
cally, a consonant cluster 1s phonotactic in the strict sense when it occurs 
within a morpheme (such as /kt/ in German nackt ‘naked’ or Akt ‘act’, or 
in English <act> or <detect>, or as /nd/ in German Kind ‘child’ or Rand 
‘edge’, or in English <kind> or <sound>). A consonant cluster counts 
as morphonotactic when it results from a morphological operation such 
as concatenation (such as /kt/ in German zuck-te ‘jerked’, tank-te ‘re- 
fuelled’, and in English <kick-ed>, <thank-ed>). It is important to note 
that apart from clusters that are purely phonotactic (such as final /mp/ in 
English <limp>, or final /mpf/ as in German Dampf ‘steam’) or purely 
morphonotactic (such as final /md/ in English <seem-ed>; or final /nkst/ 
in German /enk-st (steer-2SG) ‘you steer’), many clusters can occur both 
phonotactically and morphonotactically (e.g. the cluster /kt/ in German 
as in Akt ‘act’ vs. zuck-te ‘jerked’, or /nd/ in English as in <kind> vs. 
<sign-ed>). A much less frequent interaction between phonotactics and 
morphotactics takes place when phonological deletion produces a conso- 
nant cluster in inflection or word formation (as in German Risiko ‘risk’, 
Risk-en (risk-PL) ‘risks’, risk-ant (risk-ADJ) ‘risky’ ). 

A main focus of several of the articles collected in this volume is the 
Strong Morphonotactic Hypothesis (SMH), as proposed by Dressler and 
Dziubalska-Kotaczyk (2006). The SMH claims that morphonotactic con- 
sonant clusters are favoured in processing and acquisition compared to 
phonotactic clusters. This implies a synergy of morphology and phonol- 
ogy, with the acquisitionist effect that morphonotactic consonant clus- 
ters are acquired earlier than corresponding phonotactic clusters. This 
was first interpreted as an earlier emergence of morphonotactic clusters 
(i.e. when a consonant cluster is first produced correctly), but Keli¢ and 
Dressler (2019) have shown for Croatian that it rather holds for mastery 
of consonant clusters (i.e. when a cluster continues to be produced cor- 
rectly). As to processing, the SMH claims that morphonotactic clusters 
are processed more accurately and rapidly than corresponding phonotac- 
tic clusters. These claims have been restricted to languages or morpho- 
logical components with a rich morphology, such as inflection in Slavic 
languages, whereas they may not hold for languages and morphological 
components with less morphological wealth, such as inflection in Ger- 
manic languages. See, for more on this, the chapter by Sommer-Lolei et 
al. in this volume. 

The articles represent the result of a joint French-Austrian interdis- 
ciplinary project with the title ‘Human Behaviour and Machine Simula- 
tion in the Processing of (Mor)Phonotactics’ funded by the ANR (Agence 
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Nationale de la Recherche, ANR-13-ISH2-0002) and the FWF Austrian 
Science Fund (Fonds zur Férderung der wissenschaftlichen Forschung, I- 
1394-G23). The main focus of investigation of the project was the study of 
the psycho-computational representation of (mor)phonotactics in French 
and German speakers from two angles simultaneously: human behaviour 
and machine simulation. Both of them cover a broad range of activities: 
from computational simulations (computational models appealing solely 
to distributional information for the linguistic data and processing the sta- 
tistical regularities of representative corpora) to longitudinal studies of 
acquisition (in order to test whether there are systematic differences with 
regard to the phases at which phonotactic and morphonotactic clusters are 
acquired), psycholinguistic analyses (aiming at verifying the psychologi- 
cal plausibility of hypotheses on the phonotactics/morphonotactics distinc- 
tion) and production analysis (focusing on the phonetic repair mechanisms 
and the systematic differences in the production of morphonotactic and 
phonotactic clusters in actual speech data). The present volume contains 
five papers focusing on the acquisition, speech production, processing and 
corpus linguistic analysis of morphonotactic vs. phonotactic clusters. 

The papers combine distributional analysis and experimental investi- 
gations based on large corpora or on the analysis of the speakers’ behav- 
iour in producing phonotactically marked structures such as consonant 
clusters. 

The first paper of the collection ‘German phonotactic vs. morpho- 
notactic obstruent clusters: a corpus linguistic analysis’ by Wolfgang 
U. Dressler and Alona Kononenko-Szoszkiewicz presents a corpus- 
based study of the obstruent clusters in German. In particular, the pa- 
per investigates the distribution, in terms of type and token frequency, 
of triple consonant clusters (excluding glides) containing two obstruents. 
The study is framed within the NAD (Net Auditory Distance) model, a 
net reflection of the difference between adjacent segments in terms of the 
manner and place of articulation (Dziubalska-Kolaczyk 2002). One main 
result discussed by the authors is that, according to NAD predictions, (at 
least triple) morphonotactic clusters are preferred over phonotactic clus- 
ters for German word-final position, which supports the Strong Morpho- 
notactic Hypothesis (SMH, as described above). This must be compared 
with psycholinguistic evidence, as reported in the chapter by Sommer- 
Lolei et al. (below). The typological characterization of the German lan- 
guage with regard to the word-final and word-initial obstruent clusters, in 
contrast to Slavic and other Indo-European languages, is also discussed 
at the end of the paper. 
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The paper ‘Morphonotactics in speech production’ by Hannah Ley- 
kum and Sylvia Moosmiiller’ investigates the influence of morphology 
on the phonetic realization of utterances. The authors perform acoustic 
analyses of word-medial and word-final consonant clusters, which could 
occur both within a morpheme as well as across morpheme boundaries. 
The hypothesis underlying the study is that consonant clusters across 
word-internal morpheme boundaries (morphonotactic clusters) are ex- 
pected to be more robust and more highlighted in speech production than 
consonant clusters within a morpheme (phonotactic clusters). The anal- 
yses are conducted in three different language types: a word language 
(Standard German German, SGG), a mixed-type language (Standard 
Austrian German, SAG) and a quantifying language (Standard French, 
FR). These three types were chosen to investigate whether language- 
type-specific timing characteristics have an influence on the highlight- 
ing/reduction of consonant clusters. Concerning the language type, the 
authors hypothesize that differences between phonotactic and morphono- 
tactic clusters are more pronounced in SGG as compared to SAG, and the 
differences are expected to be greater than those in FR for both varieties 
of German. The results of the analyses fail to confirm the main hypothesis 
and reveal that there is no difference in respect of durational and inten- 
sity characteristics between clusters with and those without a morpheme 
boundary. However, as the authors state, the absence of any effects does 
not necessarily imply that no direct influence of morpheme boundaries on 
the realization of consonant clusters exists, thus overriding an impact on 
phonology. Besides language-specific timing characteristics, other lan- 
guage-specific differences could exist. The three investigated languages 
share a low morphological richness, raising the question of whether the 
morphological richness of a language determines whether phonotactic 
and morphonotactic clusters behave the same or not. It is possible that in 
morphologically richer languages, the information about the morpheme 
boundary is more important to ensure intelligibility. 

The paper ‘The acquisition and processing of (mor)phonotactic 
consonant clusters in German’ by Sabine Sommer-Lolei, Katharina 
Korecky-Kroll, Markus Christiner and Wolfgang U. Dressler pres- 
ents a set of psycholinguistic experiments testing the Strong Morphono- 
tactic Hypothesis (SMH) which claims that morphonotactic consonant 
clusters foster processing and acquisition. The two psycholinguistic pro- 
cessing methods used are progressive demasking and lexical decision. 
The results partially confirm the SMH, showing a significant positive ef- 
fect only for rich compounding, a partial trend for less rich derivational 
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morphology and no effect for inflection, which is relatively poor in Ger- 
man and thus cannot facilitate lexical processing. A psycholinguistically 
important finding is that familiarity often has a greater facilitating effect 
than frequency. The acquisition part of the chapter presents longitudinal 
data up to 3 years (3;0) of age and quasi-longitudinal transversal data up 
to a mean age of 4;8. Since the data does not suffice for making separate 
statistical analyses for inflection, derivation and compounding, no facili- 
tating effects have been found in previously reported analyses. This con- 
trasts with the facilitation effects found in morphology-rich Lithuanian, 
Polish and Croatian inflectional and derivational morphology. 

The paper ‘Exploring phonotactic and morphonotactic constraints in 
the acquisition of consonant clusters in L1 French’ by Barbara Köpke, 
Olivier Nocaudie and Hélène Giraudo focuses on the possible effects 
of age, position and phonotactic vs. morphonotactic status in the success- 
ful pronunciation of the different French consonant clusters. The authors 
analyse distributionally longitudinal CHILDES data from four children 
(aged 1;6 to 3;0) collected in spontaneous speech interactions between a 
parent and the child. The analysis shows a high variation of error types 
(such as reduction, substitution, omission, repetition, epenthesis, shifted 
cluster or mixed sounds) in the characterization of consonant clusters. A 
more detailed exploration of the individual developmental trajectories, 
however, demonstrates the presence of an overall developmental pattern 
with the number of omissions decreasing while the number of reductions 
increases within the age groups. Concerning the consonant cluster’s posi- 
tion in the word, overall French children have a tendency to a left-side 
preference in the development of the pronunciation of clusters. Finally, 
also, the morphonotactic status of the cluster seems to have a significant 
effect on the development of pronunciation, although only in a medial 
position. According to the authors, this positive effect of the morphono- 
tactic status should be pondered in relation to several factors inherent to 
the corpus which may modulate and affect the results. In particular, mor- 
phonotactic clusters are relatively scarce in French and they never appear 
in the word-initial position, in contrast to a medial position and especially 
the word-final position, which seems a less favourable position in early 
acquisition. This and other considerations led the authors to the conclu- 
sion that an extension of the study to later developmental stages in older 
children, with a consistent vocabulary between the age groups, is needed, 
in order to weigh in detail the influence of frequency and position effects 
in the error patterns related to the (mor)phonotactic status of consonant 
clusters. 
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The last paper ‘The natural perceptual salience of affixes is not incom- 
patible with a central view of morphological processing’, by Héléne Gi- 
raudo, Karla Orihuela, Basilio Calderone and Barbara Köpke reports 
on a set of behavioural experiments testing the reactions of French adults 
in a letter search task. The authors discuss the issue of morphonotactic 
processing through the notion of morphological salience — the functional 
and perceptual relative prominence of the whole word and its morpho- 
logical components — and its implications for theories and models of mor- 
phological processing. With regard to the SMH, the task was carried out 
using words that include the target letter after a morphonotactic boundary 
(e.g. vivre ‘to live’ which contains viv- as a morphological base (stem) and 
re- as a Suffix, a marker of an infinitive) against those with a purely phono- 
tactic one (e.g. centre ‘centre’ in which -re is not a suffix and cent- is not 
a stem). The main hypothesis is that morphonotactic segmentation should 
be facilitated due to a double salience conveyed in the boundaries, as it 
is not only phonological but also morphological. The effects of position, 
initial vs. final, are also explored. The final results show that prototypical 
morphonotactic sequences are processed faster than phonotactic sequenc- 
es in a final position, suggesting that phonotactics helps to decompose 
words into morphemes by enhancing their morphological salience. 

Taken together, the papers offer an interdisciplinary view of (mor)pho- 
notactics, as they provide acoustic-phonetic, psycholinguistic and corpus- 
based evidence in support of the proposed theoretical claims about the 
nature of phonotactic and morphonotactic structures. 

Another merit of the present volume is its crosslinguistic methodol- 
ogy, including two phonologically and morphologically relatively distant 
languages such as French and German. 

Comparing phonotactically very different Germanic and Romance 
languages in the analysis provides a larger and more informative picture 
of (mor)phonotactics in these two languages. 

Behavioural analyses are of particular relevance for the development 
of crosslinguistically valid generalizations on (mor)phonotactic process- 
ing. Psycholinguistic tests applied to the two languages may help to define 
a continuum of phonotactic and morphonotactic complexity, with respect 
to which the two languages will occupy partly different and partly over- 
lapping positions. Similarly, the crosslinguistic differences which emerge 
from the analysis of speech production contribute to the definition of the 
continuum of (mor)phonotactic complexity. 

We hope that this special issue will provide inspiring suggestions for 
further investigations, including interdisciplinary approaches, within the 
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domain of the acquisitional, cognitive and physical aspects of sound or- 
ganization in languages, thus contributing to our knowledge of how hu- 
man speech structures are acquired, mentally organized and physically 
produced. 

The volume is dedicated to the memory of our colleague Sylvia Moos- 
miller (1954—2018) who died shortly after finishing her part of the joint 
contribution with Hannah Leykum. 
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I. German phonotactic vs. morphonotactic 
obstruent clusters: a corpus linguistic analysis 


WOLFGANG U. DRESSLER!?-? 
ALONA KONONENKO-SZOSZKIEWICZ! 


1. INTRODUCTION 


1.1. AIMS 


In this contribution we provide for the first time a typological charac- 
terology (in the sense of Mathesius 1928; Lang & Zifonun 1996) of the 
morphonotactics vs. phonotactics of a single language, compared to con- 
trastive studies such as Dressler et al. (2015) on German vs. Slovak and 
Zydorowicz et al. (2016) on Polish vs. English. We focus on word-initial 
and word-final positions (cf. section 4) and on triple consonant clusters 
(excluding glides) containing two obstruents, because these are more 
typical for German than for many other languages. We approach them in 
terms of an interaction between Natural Phonology and Natural Morphol- 
ogy and the Beats-and-Binding phonotactics of Dziubalska-Kotaczyk 
(2009). We limit our investigation to standard vocabulary and exclude 
onomastics, because it contains clusters that do not occur in standard vo- 
cabulary, such as gm- in many place names (Gmünd, Gmunden etc.). 

With regard to phonological typology, German, like other German- 
ic languages, is a rather consonantal language in respect of the relative 
amount of its consonantal inventory and its variety and complexity of 
consonant clusters (cf. Maddieson 2006, 2013; Donohue et al. 2013), al- 
though — in contrast to several Slavic languages, for example — German 
has syllabic sonorants only in an unstressed position in casual speech. 
German has several voiceless affricates, among the typologically rather 
rare ones the labial-labiodental /pf/ (Luschiitzky 1992). German is richer 
in consonant clusters word-finally than word-initially, in contrast to most 
Romance and many other non-Germanic Indo-European languages. Pho- 
nological typology, though discussed at least since Trubetzkoy (1939), 
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has focused on the characteristics of phonemes, phoneme oppositions and 
phoneme inventories. If phonotactics has been treated at all, then it is 
in terms of syllable structures. Even the recent publications of Hyman 
(2007), Blevins (2007) and Hyman and Plank (2018) mention consonant 
clusters at most in passing and never discuss triple or quadruple clusters 
(for contrastive studies of German, see section 1.6). This lacuna may be 
due to phonological typologists not working with large electronic cor- 
pora, which we do for German in this contribution. 

In continuation of previous theoretical and contrastive work (Dressler 
& Dziubalska-Kotaczyk 2006; Dressler, Dziubalska-Kolaczyk & Pestal 
2010; Korecky-Kroll et al. 2014) we are going to characterize German 
patterns of consonantal morphonotactics vs. phonotactics from a phono- 
logical, morphological, typological and corpus linguistic perspective. 

We investigate prototypical rather than non-prototypical cases of mor- 
phonotactics, i.e. the prototypical case of merely concatenative shapes of 
morpheme combinations, particularly when they differ from the phono- 
tactics of lexical roots and morphemes and thus signal morpheme bound- 
aries, as in English seem-ed /si:m-d/ (i.e. there is no lexical final [-md] 
cluster in English). The non-prototypical case of morphological combi- 
nations resulting in vowel deletion is marginal in German, e.g. in Risiko 
‘risk’, adj. risk-ant ‘risky’ (in contrast to the regular case of schwa dele- 
tion, more in section 4). 


1.2. PHONOTACTICS VS. MORPHONOTACTICS 


Morphonotactic clusters differ from phonotactic ones through the 
interaction of morphotactics with phonotactics (Dressler & Dziubalska- 
Kolaczyk 2006; Calderone, Celata & Laks 2014; Zydorowicz et al. 2016). 
More specifically, morphonotactic clusters are either due to the addition 
of a further morpheme, an affix in the case of derivational morphology or 
another lexical morpheme in the case of compounding, or due to a sub- 
tractive morphotactic operation which leads to vowel deletion, as in Ger. 
silbr-ig ‘silvery’ from Silber ‘silver’ (more in section 4.2). 

Because of this interaction between morphology and phonology, it 
has been claimed (Dressler & Dziubalska-Kotaczyk 2006: 19-20) that in 
general morphonotactic clusters are less preferred than phonotactic ones. 
This contrasts with the Strong Morphonotactic Hypothesis (Dressler & 
Dziubalska-Kotaczyk 2006; Dressler et al. 2010), which states that in pro- 
cessing and first language acquisition the interaction of morphology with 
phonotactics facilitates both processing and acquisition. A further claim 
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on the interaction between morphology and phonology has been made 
by Shosted (2006), who has found a (statistically insignificant) trend of 
a positive correlation between complexity in the syllable structure and 
morphological complexity. It would be worth separating phonological 
and morphonotactic clusters, because only complex morphonotactics 
should correlate with morphological complexity. 

In order to define the level of deviation of morphonotactic (i.e. mor- 
phologically and phonologically motivated) consonant clusters from 
purely phonotactic (i.e. merely phonologically motivated) ones in 
German, we have applied the gradual scale proposed by Dressler and 
Dziubalska-Kołaczyk (2006). These are clusters such as the following 
English ones: 

1) Clusters which are always morphologically motivated, i.e. never 
occur in monomorphemic words (cf. Dressler 1985: 220 f.). To this group 
belongs a consonant cluster /-md/ which always occurs in past participles 
due to concatenation of a sonorant with the suffix, as in seem-ed, claim- 
ed. Other examples of this group are the word-final consonant clusters 
/-fs, -vz/ as in laughs, loves, wifes, wives, which occur only in plurals, 
third person singular present forms and in Saxon genitives. 

2) Clusters, which are morphologically motivated as a strong default, 
i.e. which are paralleled by very few exceptions of a morphologically un- 
motivated nature. For instance, the cluster /ts/ in most cases occurs across 
word boundaries, as in lets, meets, but also in morphologically simple 
words as in quartz, hertz. Moreover, in English a strong default is present 
in a cluster /-ps/ as in steps, keeps, except the borrowings from Latin such 
as apse, lapse, and glimpse. 

3) Clusters, which are morphologically motivated as a weak default, 
i.e. which are paralleled by more exceptions of a morphologically unmo- 
tivated nature. An example is the consonant cluster /-ks/, which is always 
morphonotactic in the third person singular verb endings and in plurals 
as in speaks, oaks, and a phonotactic cluster related to the spelling <x> 
as in fox, mix. 

4) Clusters, whose minority is morphologically motivated, i.e. which 
are quite normal phonotactic clusters and may also have some morpho- 
logical motivation. To this group belongs the cluster /-nd/ that occurs 
across morpheme boundaries in past-tense verbs or past participles as 
in grinned, tanned. Moreover, as a phonotactic cluster, it is present in a 
number of words such as hand, land, around. 

5) Clusters which are only phonotactic, thus never divided by a mor- 
pheme boundary, such as /rf, sk/, as in turf, ask. 
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The theoretical background of our contribution is Natural Phonol- 
ogy and Morphology (cf. Dressler 1984; Dziubalska-Kotaczyk & Weck- 
werth 2002; Dziubalska-Kolaczyk 2009; Kilani-Schoch & Dressler 2005; 
Dressler & Kilani-Schoch 2017), as well as morphonology (Dressler 1985, 
1996a,b), of which morphonotactics is a part (Dressler & Dziubalska- 
Kolaczyk 2006). This approach not only strives towards descriptive und 
explanatory adequacy but also towards guaranteeing, at least partially, the 
psychological reality of the linguistic constructs. This demands a psycho- 
linguistic perspective (cf. Korecky-Kroll et al. 2014 and Sommer-Lolei 
et al. this volume). In usage-based linguistic and psycholinguistic ap- 
proaches (Bybee 2001; Bauer 2001; Tomasello 2003), it is often claimed 
that token frequency is important only for the question of storage (which 
is not an issue here), whereas only type frequency and the discrepancy 
between high type frequency and low token frequency is relevant for the 
productivity and profitability of patterns (cf. Du & Zhang 2010; Berg 
2014). Here we compare type and token frequencies, in order to evaluate 
these claims with fresh data. 


1.3. BEATS-AND-BINDING MODEL OF PHONOTACTICS 


We investigate consonant clusters in the framework of the Beats-and- 
Binding phonotactic model established by Dziubalska-Kolaczyk (2002, 
2009) which is embedded in Natural Linguistics (Dziubalska-Kolaczyk 
& Weckwerth 2002) and specifically in Natural Phonology. It is a sylla- 
ble-less model, which explains the organization of consonant clusters in a 
language where beats constitute vowels (or the marked option of syllabic 
sonorants) and consonants are typically non-beats. A core of the Beats- 
and-Binding model is the Net Auditory Distance (NAD) Principle, which 
started as a modification of the Sonority Hierarchy principle (Whitney 
1865; Sievers 1876; Jespersen 1904; Ohala 1990), called the Optimal So- 
nority Distance Principle (Dziubalska-Kotaczyk 2002: 82). The present 
NAD model offers the broadest existing possibility for defining degrees 
of intersegmental cohesion (Bertinetto et al. 2006) in terms of binding 
between the beat and adjacent non-beats and between adjacent non-beats, 
including the preferredness of a cluster. 

NAD stands for the measure of auditory distances between neighbour- 
ing phonemes and allows construction of the hierarchy of preferences 
from the most to the least preferred cluster. A preference is understood 
as basically a universal preference which can be derived from more basic 
principles (Dressler 1999). A cluster is preferred if it satisfies a pattern of 
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phonetic distances in terms of the place and manner of articulation plus 
the sonority between clusters specified by the universal preference rel- 
evant for their initial, medial or final position in the word (cf. Dziubalska- 
Kotaczyk 2009, 2014). 

It is generally assumed that consonantal languages have more dispre- 
ferred consonant clusters than vocalic languages. In order to operational- 
ize this assumption and to determine the status of consonant clusters in 
German, a software package, namely the Phonotactic Calculator devel- 
oped by Dziubalska-Kolaczyk, Pietrala and Aperlinski (2014) based on 
earlier work by Grzegorz Krynicki, can be applied. The default parameter 
values of the calculator include the manner of articulation (MOA), the 
place of articulation (POA), and a hierarchy of S/O (sonorant/obstruent) 
distinctions. Due to the Phonotactic Calculator’s settings, the maximum 
number of consonant sequences to be analysed is bounded by triple clus- 
ters. Therefore, the current analysis of cluster preferredness in German is 
demonstrated based on triple consonant clusters. 

Let us present the general predictions for a triple consonant cluster 
C1C2C3V, first for the word-initial position: 

NAD (C1, C2) < NAD (C2, C3) > NAD (C3, V) 

It reads: “For word-initial triple clusters, the NAD between the third 
consonant and the second consonant should be greater than or equal to 
the NAD between this third consonant and the vowel, and greater than the 
NAD between the second and the first consonant” (Dziubalska-Kotaczyk 
2014: 5, also for the following citations). 

For the word-final position VC1C2C3 it states: 

NAD (V, C1) < NAD (C1, C2)>NAD (C2, C3) 

The condition reads: “For word-final triple clusters, the NAD between 
the first consonant and the second consonant should be greater than or 
equal to the NAD between this first consonant and the beat, and greater 
than the NAD between the second and the third consonant.” 

The condition for medial triple clusters VC1C2C3V states: 

VC1C2C3V NAD (V, C1) > NAD (Cl, C2) & NAD (C2, C3) < NAD 
(C3, V2) 

It reads: “For word-medial triple clusters, the NAD between the first 
and the second consonant should be less than or equal to the NAD be- 
tween the first consonant and the beat to which it is bound, whereas the 
NAD between the second and the third consonant should be less than 
between the third consonant and the beat to which it is bound.” 

The NAD product indicates a mean number of all the distances between 
the neighbouring phonemes in the cluster. It was introduced to the calcula- 
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tor in order to assign a preferability index which is “a number denoting a 
degree to which a given preference is observed” (Dziubalska-Kolaczyk 
2019). The formula for word-initial consonant clusters is as follows: 
NAD product = NAD C1C2 — NAD C2V 
Thus, it allows the clusters to be ordered according to their degree of 
preferability values from the most preferred to the least. 


1.4. PRINCIPLES OF NATURAL MORPHOLOGY RELEVANT FOR 
MORPHONOTACTICS 


Natural Morphology is a theory of preferences (Dressler 1999; 
Dressler & Kilani-Schoch 2017) divided into three subtheories. Of the 
first one, which accounts for universal preferences, the most relevant for 
morphonotactics are the parameters of iconicity (especially construction- 
al diagrammaticity) and transparency. In connection with the subparam- 
eter of constructional diagrammaticity, German morphonotactic conso- 
nant clusters are nearly always due to affixation, which is the most iconic 
operation, whereas anti-iconic subtraction, as in risk-ant ‘risky’, derived 
from Risiko ‘risk’, is very rare (more in section 3). High transparency 
favours morphological decomposition, which is undertaken automati- 
cally in processing: also from this perspective, affixation facilitates de- 
composition more than word-internal modification and subtraction, and 
when a consonant cluster is only morphonotactic, the morpheme bound- 
ary is more salient, which facilitates decomposition or segmentation (cf. 
Korecky-Kroll et al. 2014). Also, high morphosemantic transparency fa- 
cilitates decomposition, whereas opacity hinders it (Libben 1998; Gagné 
2009: 264-268; Hongbo, Gagné & Spalding 2011; Dressler, Ketrez & 
Kilani-Schoch 2017). For example, the relationship between Ger. Kun-st 
‘art’ and its verb base kénn-en ‘be able, can’ is both morphotactically and 
morphosemantically obscure (cf. below and section 2.2). 

Within the second subtheory, typological adequacy, German can be 
characterized as a weakly inflecting language, whose morphology is 
moderately rich (except in compounding). Thus, compounding may cre- 
ate more morphonotactic clusters than inflection or derivation. Unfortu- 
nately, we cannot investigate systematically word-internal clusters due to 
compounding because of our corpus; there is a lack of corpus linguistic 
tools for doing this semi-automatically. German is also a more suffixing 
than prefixing language. That inflectional prefixation cannot create con- 
sonantal clusters, corresponds to the type of suffixing language to which 
German belongs. 
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Within the third subtheory of system adequacy, the criterion of pro- 
ductivity (Bauer 2001; Dressler, Libben & Korecky-Kröll 2014) is very 
relevant: productive morphological rules, such as plural formation, inflec- 
tion for person and past participle formation, are liable to be involved in 
many more morphonotactic consonant clusters than unproductive rules, 
such as deverbal action/result noun formation, such as in Dien-st ‘service’ 
and Kun-st (see above). The endpoint of non-productivity is reached in 
the case of fossil morphemes, such as the prefix in Aber-glaube ‘supersti- 
tion’, where the base Glaube ‘faith’ is easy to detect. Still we can classify 
its internal triple consonant cluster /rgl/ as morphonotactic. 

Although, from a semiotic point of view morphology is more impor- 
tant than phonology for morphonotactics (Dressler 1985, 1996a), dia- 
chronic change may transform morphonotactic clusters into phonotactic 
clusters, but not vice versa (cf. Dressler et al. 2019). 


1.5. DATABASE 


The corpus linguistic research was based on the data extracted from 
the Austrian Media Corpus (AMC), which was developed at the Austrian 
Academy of Sciences (cf. Ransmayr, Morth & Matej 2017). It is consid- 
ered to be one of the largest corpus collections of the German language. It 
covers all printed resources from Austrian printed media for the last two 
decades, including the transcripts of Austrian television and broadcast 
news plus the news reports of the Austria Press Agency APA. This corpus 
contains about 40 million texts of various genres containing about 10 bil- 
lion word tokens. It is linguistically annotated with morphosyntactic in- 
formation and lemmatized. Due to its functionality, a list of all word types 
and word tokens containing the specific clusters in a given corpus can 
be selected along with the frequency of occurrence and part of speech. 
Clearly the numbers of types (inflectional word forms) given in the lists 
below refer to what is attested in the AMC; the number of potential cor- 
rect forms is higher. 

The starting point of the research was obtaining the data from the 
AMC. The corpus automatically allows identification of the position of a 
cluster, thus different queries were specified in the research. For instance, 
for the word-initial position the following query was involved “str.+”. It 
reads word-initial triple cluster /str-/ followed by one or more character. 
Thus, all consonant clusters along with their frequency of occurrence in 
the corpus were retrieved, according to their position in the word, for 
further analysis. The next stage included the elimination of all irrelevant 
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words, such as proper names, misspellings or non-words. The last stage 
of the analysis was the division of the words into three groups depending 
on whether the cluster is only morphonotactic, only phonotactic or both. 

The second analysis related to measuring auditory distances in the clus- 
ter via the NAD calculator, which was introduced in the previous section. 
All examples are written in the national German orthography. In the Ger- 
man consonantal system, a phoneme <ch> is a voiceless palatal or velar 
fricative; <sch> (and word-initial <s> before a stop) is a voiceless sibilant. 
For the NAD calculator /r/ is specified as an uvular liquid approximant. 

All clusters will be presented according to their position and each 
cluster will be exemplified by a single word, selected according to its 
high token frequency. If the number of word types occurred fewer than 
five times in the corpus, these words were eliminated from the analysis 
because most of them consisted of orthographic mistakes or they were 
non-words (especially names). 


1.6. GERMAN PHONOTACTICS 


The phonotactics of German consonant clusters has been described 
several times. Meinhold and Stock (1980: 180-188) include in their de- 
scription differences between positions and observe the influence of mor- 
phology and of phonostylistics. Hirsch-Wierzbicka (1971) aims to present 
an exhaustive overview of consonant clusters, but limited to monosy]l- 
lables. Thus, several word-initial and word-final triple and quadruple con- 
sonant clusters are missing (to some extent also for monosyllabic words). 
There are also incorrect statements about disallowed peripheral clusters. 
A classical generative account can be found in Heidolph, Flamig and 
Motsch (1981: 977—990) with the concept of the phonological structure 
conditions of morphemes (formatives) vs. words. 

Szczepaniak (2010: 107) and Fehringer (2011: 97) found specific, but 
very limited corpus-based evidence that German seems to avoid long 
word-final morphonotactic consonant groups, insofar as a rising number 
of consonants correlates with a rising preference for the masculine and 
neuter genitive allomorph -es instead of the allomorph -s. This presuppos- 
es a continuum for cluster complexity, whereas Wiese (1988, 1991, 2000; 
cf. Orzechowska & Wiese 2011, 2015) makes a sharp distinction between 
marked extrametrical consonants (the third and fourth most peripheral 
consonant of a cluster) and the other consonants of a cluster (more in sec- 
tions 2.5 and 4.2); loan words are considered to have more extrasyllabic 
consonants, i.e. more complex consonant clusters (cf. also section 3). 


I. German phonotactic vs. morphonotactic obstruent clusters 23 


2. WORD-FINAL POSITION 


In contrast to most Slavic and Romance and more conservative Indo- 
European languages, Germanic languages are rather rich in word-final 
consonant clusters, of both a phonotactic and a morphonotactic nature. 
Moreover word-final clusters are more complex and more numerous and 
more varied in types than word-initial ones. 

The morphonotactic clusters occur in the final position in 2™ SG. per- 
son and are mainly represented by 3™ SG. verb forms, superlatives or 
past participles, as shown in Dressler and Dziubalska-Kotaczyk (2006; 
cf. Dressler et al. 2010). They end with the suffixes -st (2™ SG., superla- 
tive, plus the unproductive deverbal noun-forming suffix) and -t (3 SG., 
past participle and denominal circumfixes derived from the past parti- 
ciple, ordinal-number-forming suffix). 


2.1. QUADRUPLE CLUSTERS 


All word-final quadruple clusters consist of a sonorant and 3 obstru- 
ents, the two last being always /st/. All are either only morphonotactic or 
morphonotactic by default. 

The following 20 clusters are only morphonotactic (always 2"! SG., 
sometimes also 3™ SG. or past participle): 

/-Ikst/ (5): melk-st ‘(you) milk’, ver-folg-st ‘(you) persecute’, 

/-rkst/ (30): merk-st ‘(you) notice’, borg-st ‘(you) borrow’, past par- 
ticiple ver-kork-st ‘messed up’. The only phonotactic case occurs in the 
noun Gwirkst that exists only in Austrian dialects and means ‘tricky af- 
fair’: this does not count for the standard. 

/-mpst/ (11): pump-st ‘(you) pump’, plumps-t ‘(s/he) flops’ = plumps- 
st ‘(you) flop’ (with obligatory degemination of /s+s/), 

/-mpfst/ (10): kämpf-st ‘(you) fight’, 

/-nfst/ (3): wiinsch-st ‘(you) wish’, 

/-ntfst/ (3): plantsch-st ‘(you) splash’, recent English loan words 
launch-st, lunch-st. In oral speech, the /s/ is most often reduced after /f, tf 
/ when followed by /t/. 

/-lfst/ (3): hilf-st ‘(you) help’, 

/-rfst/(65): darf-st ‘(you) may’, nerv-st ‘(you) enervate’, 

/-rmst/ (29): form-st ‘(you) form’. 

/-Imst/ (8): film-st ‘(you) film’, 

/-lxst/ (2): strolch-st ‘(you) roam about’, 

/-rxst/ (11): schnarch-st ‘(you) snore’, 
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/-ftsst/ (2): seufz-st ‘(you) sigh’: normally the /s/ is fused with the 
preceding affricate, 

/- xtsst (3): 4chz-st ‘(you) groan’ (same fusion), 

/-rtsst/ (2): stiirz-st ‘(you) fall’ (same fusion), 

/-Lfst/ (2): falsch-st *(you) falsify’, feilsch-st ‘(you) haggle’, 

/-Itsst/ (1): salz-st ‘(you) salt’ (same fusion; 4 others potential, but not 
attested). 

The following clusters are Gen.SG. of isolated masculine and neuter 
nouns: 

/-yksts/ (1): Hengst-s ‘stallion’ (masc.), 

/-rpsts/ (1): Herbst-s ‘autumn’ (masc.), plus its numerous compounds, 

/-Ipsts/ (1): Selbst-s ‘the self’ (neuter), 

/-rnsts/ (1): Ernst-s ‘earnestness’ (masc.), plus its numerous 
compounds. 

The four following quadruple clusters are morphonotactic only as a 
strong default: 

/-nkst/ as in denk-st ‘(you) think’ and in a variant pronunciation of 
-ngst, aS in sing-st ‘(you) sing’, superlatives jiing-st ‘recently’, the 
morphosemantically somewhat opaque adverb /dng-st ‘for a long time’ 
(closely related to the transparent superlative der/die/das ldng-st-e ‘the 
longest’). However, there are two phonotactic exceptions: the nouns 
Angst ‘fear’ and Hengst ‘stallion’. 

/-rpst/ occurs as a morphonotactic cluster in 2"! SG. verb forms in 
stirb-st ‘(you) die’, wirb-st ‘(you) advertise’ (and their preterits). The 
only phonotactic exception is Herbst ‘autumn’ and compounds thereof 
(with diachronic loss of a schwa, cognate with Engl. harvest). 

/-Ipst/is only morphonotactic in stülp-st ‘(you) turn up (the collar)’ and 
riilps-t ‘(s)he burps’ = 2" SG., Part. ge-riilps-t. The transitional exception 
is selb-st ‘oneself’ with a fossil suffix, related to der/die/das-selb-e ‘the 
same’. 

/-rnst/ occurs as a morphonotactic cluster in 2"! SG forms, as in lern- 
st ‘(you) learn’, and as phonotactic only in the adj. ernst ‘earnest’ and its 
conversion into a noun. 

Table 1 presents for each cluster the number of word types, its token 
frequency in the corpus and the type-token ratio. Since the NAD calcula- 
tor is not able to measure all the distances within the quadruple clusters, 
no preferences can be deduced, but we chose the type-token ratio (TTR) 
calculation in order to arrive at some generalizations about the morpho- 
notactic vs. phonotactic distribution of these clusters: 
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Table 1. Distribution of word-final quadruples 
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The type-token ratio is the most commonly used index of lexical di- 
versity of a text, i.e. the number of tokens divided by the number of word 
types (McEnerny & Hardie 2012), which allows us to analyse the lexical 
variation of vocabulary containing a specific cluster in the corpus. 

It can be observed that: 1) the overall number of tokens increases along 
with the number of word types); 2) the growth of tokens is exponential. 
Thus, relying on the data from the AMC corpus, it can be concluded that 
for word-final quadruple clusters the number of occurrences is in direct 
relation to the type frequency. Although there are also some other excep- 


26 | 
o vase [5 | oor 
«er 
a 


NINI NIUA 


— | — | — | — 


26 W. U. Dressler, A. Kononenko-Szoszkiewicz 


tions, there is a group of clusters /-lkst, -nfst, -lpst/ which consist of a so- 
norant followed by an obstruent plus /st/. They are relatively rare in types, 
nevertheless they have a high token frequency in the corpus. 

Based on the TTR, the groups of word-final quadruple clusters can be 
clearly distinguished according to 3 intervals: 1) 14 with a TTR between 
0.02 and 1.29%; 2) 3 with a TTR between 10.38 and 11.38%; 3) for 4 
clusters the TTR is exactly 100%. In addition, there are 2 with a TTR 
of 22.22%, 1 at 4.35% and 1 with a TTR of 50%. The TTR in /-rpst/ 
is the lowest, which means that there are very few words of very high 
frequency, e.g. Herbst ‘autumn’ is the most frequent word with the final 
cluster /-rpst/ in the corpus, the frequency of occurrences being due to a 
great number of compounds ending in Herbst. The second group consists 
of /-rkst, -rmst, -mpfst/, again due to the fact that there are rather few 
words that occur frequently. Finally, the TTR reaches 100% in the third 
group, where two words have just one form and two others two forms in 
the corpus. All clusters which are morphonotactic only as a strong default 
are in the first, the largest group. 

The highest type and token frequency of /-rpst/ is due to the richness 
and productivity of German compounding which leads to the high occur- 
rence of morphonotactic clusters in compounds with the final element 
Herbst ‘autumn’. Thus, the TTR is by far the lowest of all the quadruple 
clusters. The next lowest TTR occurs in /-nkst/ which is the only quadru- 
ple cluster that includes a phonotactic cluster, i.e. in Hengst ‘stallion’ and 
its numerous compounds. Something similar to compounding takes place 
in productive particle word formation. But this pattern generates final verb 
clusters only in secondary clauses such as Wenn du den Schal um-hdng-st 
‘if you put the scarf around (your neck)’, and therefore the token frequen- 
cy of such word-final morphonotactic clusters is very restricted and thus 
cannot compete with the number of phonotactic clusters in compounds. 

Thus, the type-token ratio proves to be a far better distinguisher of 
quantitatively similar groups than the type or token frequency. 


2.2. TRIPLE CLUSTERS ENDING IN -T 


As expected, triple obstruent clusters are more numerous and varied 
than quadruple clusters. Not all of them, but nearly all start with a so- 
norant. In addition to the two final obstruents /st/ we also find /ft/ and 
combinations of all existing obstruents with final /s/, of course excluding 
prefinal /s/ due to degemination of /s+s/ and prefinal /d, t/ because of the 
fusion of the dental stop and /s/ to an affricate /ts/. Due to such fusion, 
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genitives ending in /ts/ also exist, such as des Punkt-s ‘of the point’. We 
exclude from our investigation triple clusters consisting of 2 sonorants 
and 1 obstruent, such as /-Imt, -Int, -rnt/. 

The exclusively morphonotactic triple clusters are 24 in number, i.e. 
13 more clusters than the morphonotactic quadruple clusters: 

/-xst/: lach-st ‘(you) laugh’, superlative héch-st ‘most highly’, 

/-xtst/: 3! SG. achz-t ‘groans’ and its participles, 

/-fst/: schaff-st ‘(you) create’, adverb zu-tief-st ‘deepest’, nerv-st 
‘(you) get on nerves’, 

/-mst/: trdum-st ‘(you) dream’, bums-t ‘(s/he/you) bump(s)’ and its 
participle, spar-sam-st ‘most thriftily’, 

/-{st/: wisch-st ‘(you) wipe’, 

/-pfst/: klopf-st ‘(you) knock’, 

/-tfst/ : rutsch-st ‘(you) slip’, 

/-fist/: only in seufz-t ‘(s)he sighs’ (and in the reduced 2™ person, see 
above, similarly in the following examples), and in the participle ge- 
seufz-t, and its derived verbs, 

/-lft/: hilf-t ‘helps’, in weak past participles (e.g. ge-golf-t ‘golfed’), 
and in e/f-t, zwolf-t ‘eleventh, twelfth’, 

/-Ixt/: 3™ SG. and past participle er-dolch-t ‘stabbed’ 

/- Itst/: walz-t ‘(s)he waltzes’ and its participle, 

/-ntst/: tanz-t ‘(s)he dances’ and its participle, ver-wanz-t ‘bug- 
infested’, a circumfixation of Wanze ‘bug’, 

/-lft/: only in fälsch-t ‘(s)he falsifies’ and its participle and derived 
verbs, 

/-mft/: only in ramsch-t ‘(s)he buys cheap junk’ and its participle and 
derived verbs, 

/-rt{t/ only in turtsch-t ‘taps (eggs)’ and its participle, 

/-nft/ : wuinsch-t ‘(s)he wishes’ and its participle, 

/-pft/: grapsch-t ‘grabs’ and its past participle, 

/-rft/: forsch-t ‘(s)he researches’ and its participle, 

/-nt{t/: plantsch-t ‘(s)he splashes’ and its participle. 

The following examples can never be the 2 SG. (due to the 
phonological reduction of -s): 

/-nxt/ in the only verb ttinch-t ‘whitewashes’, its participles and its 
derivation into a particle verb, 

/-lkt/: melk-t ‘(s)he milks’, folg-t ‘(s)he follows’ and their participles, 

/-mpft/: kampf-t ‘(s)he fights’ and its participle, 

/-mpt/: pump-t ‘(s)he pumps’, bomb-t ‘(s)he bombs’ and their 
participles, 
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/-rpt/: zirp-t ‘(s)he chirps’ and its participle, stirb-t ‘(s)he dies’ 

/-lpt/: ttilp-t ‘(s)he turns up’ and wo/b-t ‘curves’ and their participles. 

There are just 2 clusters which are morphonotactic as a strong default 
(if we take 75% of types as the criterion): 

/-\st/: will-st ‘(you) want’, puls-t ‘(s)he pulses’ (and 2" SG.) and 
its participle, adv. schnell-st ‘most rapidly’, but clearly phonotactic in 
Wulst ‘bulge’ and its compounds. Doubtful are Schwul(-)st ‘bombast’ 
and Ge-schwul(-)st ‘tumour’, because most people can relate it to the 
base verb schwell-en ‘swell’. But this relation may be classified as rather 
metalinguistic; there is as yet no evidence that it would be active in 
processing (e.g. priming) experiments. 

/-rtst/ as in schmerz-t ‘it hurts’ (also 2"! SG. schmerz-st) and its 
participle, but a unique phonotactic instance in Arzt ‘physician’ and its 
many compounds. 

The following clusters are ambiguous with either a morphonotactic or 
a phonotactic majority: 

/-nst/ as in dien-st ‘(you) serve’ and in the homophonous noun Dien-st 
‘service’ with an unproductive deverbal nominalization suffix, grins-t ‘(s) 
he grins’ (plus 2™ SG.) and its participle, adv. fein-st ‘in the finest way’. The 
cluster is clearly phonotactic in ernst ‘earnest’, sonst ‘otherwise’, Wanst 
‘paunch’. We should also add earlier derivations such as Kunst ‘art’ which 
many relate metalinguistically, against furious artist’s opposition, to the verb 
konn-en ‘to be able’; Gunst ‘favour’, which few relate metalinguistically 
to the etymologically cognate verb génn-en ‘not begrudge smth to smbd’; 
similarly Brunst ‘sexual heat’ to brenn-en ‘burn’. In terms of types 
(excluding compounds), the cluster /-nst/ might be called morphonotactic 
by default, but the 1,993 compounds with the second element -kunst render 
the global type and token frequency of phonotactic clusters the majority. 

/-rst/ is morphonotactic in cases such as war-st ‘(you) were’, the su- 
perlative adverb schwer-st ‘heaviest’, isolated mors-t ‘(s/he/you) send 
in Morse’ and its participle vs. phonotactic Wurst ‘sausage’, Forst ‘for- 
est’, Durst ‘thirst’, erst ‘first’ (which, like its English correspondent, was 
originally a superlative), but most types occur in compounds. Ober(-) 
st ‘colonel’ is thoroughly lexicalized (morphosemantically opaque), but 
clearly related to the superlative der ober-ste ‘the highest’. When exclud- 
ing compounds, the types are morphonotactic by default. 

/-pst/ is morphonotactic in cases such as tipp-st ‘(you) type’, lieb-st 
‘(you) love’, pieps-t ‘(s)he peeps’ (also 2™ SG. and particple ge-pieps-t), 
superlative (or, more precisely, excessive) adverb herz+aller-lieb-st 
‘wholeheartedly dearest’, phonotactic in Papst ‘pope’, Obst ‘fruits’, Probst 
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‘provost’. Again, this cluster can be considered to be morphonotactic 
by default, when excluding compounds, but the abundant metaphoric 
compounds of Papst make the global type frequency and token frequency 
of phonotactic clusters majoritarian. 

/-rkt/ occurs as a morphonotactic cluster in merk-t ‘(s)he notices’ 
sorg-t ‘(s)he cares’ and their participles, but as a phonotactic cluster in 
Markt ‘market’, Infarkt ‘infarct’ and their numerous compounds. Without 
these the cluster is morphonotactic by default. 

/-ykt/ (written with also -ngt) is morphonotactic by default as in 
bring-t ‘(s)he brings’, if one excludes the noun Punkt ‘point, dot’ with 
its numerous compounds, again as the richness of German compounding 
type and token frequency hides the basic default. Another noun with the 
phonotactic cluster is /nstinkt. 

/-rxt/ (phonetically [rçt]) is similarly morphonotactic by default, as in 
ge-pferch-t ‘crammed’, with the only phonotactic cluster in Furcht ‘fear’ 
and its numerous compounds. 

/-rft/ is similarly morphonotactic by default, as in wirf-t ‘throws’ and 
nery-t ‘enervates’, with the phonotactic exceptions Werft ‘wharf’ with 
its many compounds and Notdurft ‘need’ (where the earlier morpheme 
boundary before nominalizing ¢ is obsolete). 

/-nft/ is the only cluster of this subgroup which is phonotactic by 
default, as in sanft ‘mild’ (Austrian variant Senft ‘mustard’ with a 
secondarily attached final /t/). The only morphonotactic exception is the 
ordinal number fünf-t ‘fifth’, whereas it is improbable that an analogous 
morpheme boundary is processed in Brunft ‘rut (of deer)’, historically 
derived from brenn-en ‘to burn’, because of its morphotactic and 
morphosemantic opacity, and with most nouns analogously derived from 
particle verbs with the verbal base komm-en ‘come’, such as Zukunft, 
Hinkunft ‘future’ vs. zukommen ‘approach, belong’. 

/-kst/ (also written -chst, -ckst, -gst, -xt) is morphonotactic by default, 
as in wdchs-t ‘grows’ (also in the 2™ singular weck-st ‘(s)he awakes’), 
the only phonotactic exceptions are Text ‘text’? and Axt ‘axe’ with their 
numerous compounds. 

There are no other word-final triple consonant clusters with 2 final 
obstruents, unless in foreign names, such as Minsk, Kursk. Other com- 
parable triple clusters with final -t do not occur, because conceivable and 
pronounceable clusters such as -skt, -spt do not occur as phonotactic clus- 
ters and, in contrast to English, they are excluded as morphonotactic clus- 
ters, because no verb roots (nor nouns) ending in -sk, -sp exist in German. 
Adjectives ending in -sk do not form a superlative in -sk+st, but insert 
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an -e- before the superlative suffix. Other fricatives have a still smaller 
phonotactic distribution than /s/. 

Thus, all word-final triple clusters, which contain two obstruents are 
morphonotactic (only exception: those in -n/ft), because phonotactic clus- 
ters either do not occur or only occur as the exceptions when counted in 
lemmas. But their type and token number may be competitive with mor- 
phonotactic ones due to compounding. Many of the lemmas with final 
phonotactic clusters go back to derivations with a morphonotactic cluster. 

As expected, morphonotactic clusters ending in the longer suffix -st 
have fewer phonotactic counterparts than morphonotactic clusters ending 
in the shorter suffix -t. 

Turning to a NAD analysis of triple final clusters ending in /t/, we start 
with the presentation of the frequency demonstrated in Table 2: 


Table 2. Frequency ranks of word-final triples 


Tokens TTR (%) 

9,831,812 0.063 

5,487,640 0.1 

2,601,645 0.06 
5,649,995 0.02 
0.09 

0.02 

0.4 

597,052 0.05 

560,076 0.05 

246 1,838,731 0.01 

232 0.03 

0.14 

0.01 

0.03 

0.29 

0.15 

0.01 

0.12 
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In contrast to quadruple clusters, triple clusters do not form several 
neatly separated groups according to the TTR: the TTR of just 4 clusters 
is clearly above 1%, one amounts to 20.7% and only one has a TTR of 
100%. None of the triple clusters hast just 1 type. 

The NAD phonotactic calculator establishes the preferences of the 
clusters (structure VCCC) as presented in Table 3: 


Table 3. Preference rankings of word-final triples according to NAD? 
IPA tran- NAD NAD NAD NAD prod- | Preferred 
scription (VC) (C1C2) (C2C3) uct cluster? 
Ver | 2 | és | 1i | s | 


? Three clusters /-ntft/, /-tfst/ and /rtft/ were excluded from the analysis because the NAD 
calculator does not recognize affricate /-t{/. Therefore, they were counted manually. 
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From Table 3 the following conclusions can be drawn: 

The majority of preferred clusters start with a rhotic, lateral or nasal 
sonorant followed by two obstruents or another sonorant. The most sig- 
nificant distance between the neighbouring phonemes is always greatest 
when it starts with a rhotic or lateral sonorant, for instance the NAD prod- 
uct of /rpt/ is 5.1 and the NAD product of /rtst/ is 3.85. 
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Out of 33 word-final consonant clusters, 19 clusters are preferred and 
14 dispreferred. If we add the 3 clusters that the NAD calculator could not 
handle, then we obtain 19 preferred clusters and 17 dispreferred clusters. 

However, there is the question of whether similar predictions can be 
deduced in a simpler process of calculation. Since the NAD calculator 
is the most elaborate tool for deducing the predictions on the degrees 
of markedness for (mor)phonotactic clusters so far, it is worth trying to 
modify the method of NAD calculation. 

Thus, we applied a factor analysis in order to test whether there is a 
correlation among the variables which were previously obtained in the 
present research. For the factor analysis, 30 word-final consonant clusters 
were selected and 7 independent variables. The first and second variables 
are the number of the word types and tokens from the AMC for each 
cluster followed by the auditory distances between the neighbouring pho- 
nemes according to the NAD calculator. The next two variables represent 
the information whether the cluster is preferred or dispreferred and the 
division between phonotactic vs. morphonotactic (Phon/morph) clusters 
as presented in Table 4. 


Table 4. Factor analysis for word final triple consonant cluster 


Factor loadings (Varimax normalized) 
Extraction : Principal components 
(Marked loadings are > .700000) 


Variables Factor (1) Factor (2) Factor (3) 
Types -0.024865 -0.924373 0.073853 
Tokens -0.142575 -0.849302 0.216074 


NAD (VC) -0.832629 -0.042204 0.403198 
NAD (C1C2) -0.114682 -0.090996 
NAD (C2C3) 0.051592 0.071495 -0.966928 
Preferences 0.024393 0.194894 


Expl. var 2.481534 2.129518 1.236691 
Prop. of total. var 0.354505 0.304217 0.176670 


Numbers in bold indicate a significant correlation among the vari- 
ables. For instance, in Factor (1) we may observe a certain correlation 
between NAD (VC) and NAD (C1C2). The possible explanation is that if 
we look at the NAD table of all 30 clusters, we can see that the measures 
of NAD (VC) and (C1C2) are inversely proportional to each other in most 
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of the cases. For instance, if the NAD (VC) is high then the NAD (C1C2) 
will be smaller. For example, in the word-final cluster Vfst the NAD (VC) 
is equal to 5 and the NAD (C1C2) is 0.5. And conversely, if we take the 
cluster Vrpt, where the NAD (C1C2) is equal to 6.6 and NAD (VC)=2. 

The next observation is that cluster preferredness is related to the NAD 
(VC) and the NAD (C1C2). In general, if the NAD (C1C2) is higher than 
the NAD (VC), then the cluster is more likely to be preferred. This cor- 
responds entirely to the NAD formula for triple finals shown above. 

From Factor (2) we can see that there is a certain correlation between 
word types and tokens. They are connected in the same direction, so we 
could assume that if the number of word types grows, then the frequency 
grows as well. 

For Factor (3) we can observe that the NAD (C2C3) is not connected 
to any of the variables, but it is still significant, presumably to other vari- 
ables not yet discussed. 

Most notably, the factor analysis has shown that the NAD (C2C3) is 
not related to the NAD (VC) or the NAD (C1C2), which goes against a 
well-established NAD formula for predicting the preferredness for word- 
final triple clusters. Therefore, one assumption that can be inferred is that 
the NAD distances of two phonemes in the cluster, namely the NAD (VC) 
and the NAD (C1C2) might be enough to decide on the preferredness 
of word-final clusters in German. However, more research on consonant 
clusters in different word positions as well as of different languages is 
needed in order to corroborate this statement. For that reason, we have 
compared the cluster preferredness of German, English and Polish in the 
word-initial and word-final positions via the NAD calculator when the 
most peripheral consonants were excluded from the analysis. The results 
are discussed in section 4.2. 

If we compare the preference predictions in Table 3 or just compare its 
third and fourth columns, where the NAD (C1C2) should be bigger than 
the NAD (VC), and if we split Table 2 into two based on the frequency 
ranking, putting 18 clusters into the first half and 18 into the second, then 
we find 11 preferred and 7 dispreferred clusters within the first group, and 
10 preferred and 8 dispreferred clusters in the second half. This is a posi- 
tive, i.e. supportive, but not a significant difference. With regard to the 
claim that phonotactic clusters are more preferred than morphonotactic 
clusters, we found that among the exclusively morphonotactic clusters, 
14 are preferred and 11 dispreferred, whereas among those clusters which 
are both morphonotactic and phonotactic, 7 are preferred and 4 dispre- 
ferred. This is again a positive but not a significant difference. 
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Moreover, all (but one) of the word-initial triple clusters, which are all 
exclusively phonotactic, are preferred clusters. And this seems to repre- 
sent a very significant difference from the mainly morphonotactic word- 
final clusters. However, the triple final clusters ending in -s (discussed 
in the following section 2.3) are all exclusively morphonotactic and all 
preferred clusters. 


2.3. TRIPLE CLUSTERS ENDING IN -S 


A further source of word-final morphonotactic obstruent groups is 
the nominal -s Gen.SG., less commonly the homophonous plural suffix 
as in Kalb-s ‘calf’, (also plural), Korb-s ‘basket’, Ge-zirp-s ‘chirping’, 
Schilf-s ‘reed’, Dorf-s ‘village’, Nerv-s ‘nerve’, Talg-s ‘tallow’. Parallel 
phonotactic clusters occur in Rülps ‘belch’ and Mumps. Similar 
morphonotactic clusters arise through the suffixation of plural -s, as in 
Gen.SG. and PL Tank-s, Skalp-s ‘scalp’, Ulk-s ‘trick’, and adverbial -s, as 
in aller-ding-s ‘indeed’. 

Word-final, exclusively morphonotactic, triple clusters with /s/ at the 
end are the following (all Gen.SG., if also plurals, then explicitly noted): 

/-rps/: Bewerb-s ‘competition’, Korb-s ‘basket’ and their numerous 
compounds, 

/-rfs/: Dorf-s ‘village’, Wurf-s ‘throwing’ and Nerv-s ‘nerve’ and their 
numerous compounds, 

/-rks/ as in Gen.SG. Bezirk-s ‘district’, Gen.SG. and PL of recent 
English loan-words, such as Park-s. A phonotactic exception is Murks 
‘botch’, 

/-rxs/: Monarch-s with a few compounds, 

/-rfs : Hirsch-s ‘stag’, 

/-lfs/: Wolf-s ‘wolf’, 

/-lks/: Erfolg-s ‘success’, Volk-s ‘people, folk’, 

/-lxs/: Elch-s ‘elk’ with several compounds, 

/-nks/: also PL in the English loan word Song-s, only adverb link-s ‘to 
the left,’ 

/-nfs/: Wunsch-s ‘wish’ with a few compounds, 

/-nxs/: only Monch-s ‘monk’ with its many compounds. 

/-ntf{s/: only in English loan words, e.g. Brunch-s (more than 60% 
plurals, less than 40% Gen.SG. in the average), 

/-mps/: only in English loan words (also PL), e.g. Vamp-s; a phonotactic 
exception is the loan word Mumps, 

/-Ips/ occurs only in Kalb-s ‘calf’ and in the loan word (also PL) 
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Skalp-s ‘scalp’ and their compounds; a phonotactic exception is the 
onomatopoeic Riilps ‘belch’, 

/-mpfs/: Kampf-s ‘fight’ and its compounds, 

/-mfs/ only in Ramsch-s ‘junk’, 

/-sks/ only in loan words (also PL), e.g. Disk-s. 

The frequency ranking of these clusters is presented in Table 5 


Table 5. Frequency ranks of triple clusters ending in -s 


5,608,107 0.18% 
858,787 0.51% 
189,687 0.62% 
1.23% 

0.66% 
0.4% 

0.56% 

0.01% 

0.21% 

2.24% 
1.35% 

3.45% 

1.14% 

16.7% 


The spread of the TTR is similar to the triple clusters ending in /t/, but 
there is one cluster with only one type. 

The preferences established by the NAD calculator for VCCC clusters 
are the following (see Table 6): 


Table 6. Preference rankings of word-final triples ending on -s according to NAD 


IPA NAD NAD Preferred 
transcription (VC) (C1C2) an sacs cluster? 

Vrps 2 6.6 Yes 

Vrfs 2 5.1 0.5 3.85 Yes 


Viks 25 23 24 Yes 


Vlps 2.5 4.5 2 2.25 Yes 
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Thus, all triple clusters ending in -s are preferred clusters, although all 
of them are exclusively morphonotactic, two of them with a marginal 
phonotactic exception. 

Also, there are several morphonotactic double final morphonotactic 
consonant clusters with an affricate /ts/, due to Gen.SG. and rarely PL 
-s: /xts/ as in Berichts ‘report’, /kts/ as in Projekts ‘project’, /pts/ as in 
Konzepts ‘concept’, /Its/ as in Anwalts ‘lawyer’, /nts/ as in Abends ‘in 
the evening’, and /rts/ ‘as in Jahrhunderts ‘century’. The only phonotac- 
tic correspondents are words such as Holz ‘wood’, Tanz ‘dance’, Scherz 
‘joke’, i.e. if a sonorant precedes an affricate. 

A problem is represented by imperatives of the type knicks! ‘curtsey!’, 
schubs! ‘push!’. First, it is unclear whether the word-final -s is synchron- 
ically still a derivational suffix. Second, even if not, it is unclear whether 
such imperatives are to be classified as base forms (if yes, then phonotac- 
tic) or as morphologically derived from the infinitive as a lexical entry. 


2.4. TRIPLE CLUSTERS ENDING IN -7S_ 


The masculine and neuter Gen.SG, -s (potentially, also of the homoph- 
onous plural suffix, but actually only in a single cluster) is the source 
of nearly always morphonotactic clusters ending in the affricate -ts due 
to fusion of the inflectional suffix with a stem-final dental stop (for fre- 
quency ranks see Table 7): 

/-rsts/: Durst-s ‘thirst’, 

/-Ists/: Schwulst-s ‘bombast’, 

/-psts/ : Papst-s ‘pope’, Herbst-s ‘autumn’ and their many compounds, 

/-nsts/: Dienst-s ‘service’ and its many compounds, 

/-rkts/: Markt-s ‘market’ and its many compounds, 

/-nkts/: Punkt-s ‘point’ and its many compounds, 

/-nfis/: Senft-s ‘mustard’, 

/-rpts/ only in Exzerpt-s ‘excerpt’, 

/-tsts/ only in Arzt-s ‘physician’ with its many compounds, 

/-ksts/ only in 7ext-s and its compounds 
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Table 7. Frequency ranks of triple clusters ending in -ts 


0.33% 
0.16% 
1.29% 
0.04% 
0.003% 
0.66% 
0.05% 
0.24% 
1.43% 


Here we have no groupings of clusters according to TTR, but there are 
three clusters with just one type. Again, all clusters are preferred accord- 
ing to the NAD calculator, although all of them are exclusively morpho- 
notactic. 


2.5. WORD-INITIAL POSITION 


The German standard has no monoconsonantal prefixes, in contrast to 
Bavarian-Austrian dialects, as in g storben ‘died’, b soffen ‘drunk’, z’ruck 
‘back(wards)’ etc., corresponding to Standard German ge-storb-en, be- 
soff-en, zu(-)rtick. Thus, the German standard is rather poor in word-ini- 
tial clusters, all word-initial clusters are exclusively phonotactic. Some of 
the more dispreferred ones occur only in loan words from Ancient Greek 
and their derivations, e.g. /mn-/. German phonotactic initial double clus- 
ters were partially studied in Dziubalska-Kolaczyk (2002) with regard to 
universal phonotactic preferences. Moreover, double obstruent clusters 
serve as a basis for the complexity of triple initial clusters. 

Phonotactic preferences for word-initial clusters in German have been 
studied by Orzechowska and Wiese (2011, 2015). They proposed an alter- 
native approach to the NAD which is not limited to the size of the cluster 
and is not based on a sonority hierarchy but on an empirical analysis 
of features. The analysis of German initial clusters was based on 15 pa- 
rameters, which included different values such as the cluster complexity, 
place of articulation, manner of articulation and voicing, in order to build 
a quantitative ranking of all clusters in terms of adherence to the prefer- 
ences established by the Sonority Sequencing Generalization. This last 
approach will not be followed here. 
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For our study, the most interesting word-initial double clusters consist 
of two obstruents, particularly with a fricative in first position and a stop 
in second position: /{t-/ as in statt ‘instead of’ and /{p-/ as in spielen ‘to 
play’. Words of foreign origin can also start with /sk-/ as in skeptisch 
‘sceptical’, /sp-/ as in Spatium ‘space’, /sts-/as in szenisch ‘scenic’, iso- 
lated /xt-/ as in chthonisch ‘chthonic’, and /ft-/ as in Phthisis ‘wastage’. 

A fricative is followed by another fricative, or rather approximant, in 
/fv-/ as in schwer ‘heavy’, or in loan words in /sv-/ as in Sweater, /sf-/ as 
in spharisch ‘spherical’, or /sx-/ as in Schizophrenie ‘schizophrenia’, and 
by an affricate in /tsv-/ as in zwei ‘two’. 

An obstruent is followed by a sonorant, first as a fricative, as in sch- 
reiben ‘to write’, /{m-/ as in schmecken ‘to taste’, /{n-/ as in schneiden ‘to 
cut’, /fl-/ as in schlieBen ‘to close’, /fl-/ as in flach ‘flat’, /fr-/ as in fragen 
‘to ask’, /vr-/ as in Wrack ‘wreck’, only in loan words /sm-/ as in Sma- 
ragd ‘emerald’, /xr-/ only in the isolated learned loan word Chrie ‘school 
theme’, (/vl-/ only in foreign names such as Vladimir, Wladiwostok). 

A stop is followed by a sonorant in /gr-/ as in grof ‘large’, /gl-/ as in 
glücklich ‘happy’, /gn-/ as in gnadenlos ‘merciless’, /kl-/ as in Kleid ‘dress’, 
/kr-/ krank ‘sick’, /kn-/ as in Knie ‘knee’, /bl-/ as in bleiben ‘to stay’, /br-/ as 
in brechen ‘to break’, /pl-/ as in plump ‘clumsy’, /pr-/ as in Pracht ‘splen- 
dour’, /dr-/ as in drei ‘three’, /tr-/ as in tragen ‘to wear’. An affricate is the 
first obstruent in pfl as in pflegen ‘to care for’, /pft-/ as in pfropfen ‘to graft’. 

A stop is followed by a fricative in words of foreign origin in /ks-/ as 
in Xenophobie ‘xenophobia’ or /ps-/ as in psychisch ‘psychological’. A 
stop 1s followed by the fricative or approximant /v/ in /kv-/as in Quelle 
‘source’, or by an affricate in /tsv-/ as in Zwang ‘coercion’. 

A sequence of word-initial stops is limited to words of Ancient Greek 
origin: /pt-/ as in Pteridin ‘pteridine’, /kt-/ as in ktenoid ‘ctenoid’. 

The majority of double clusters that do not occur only in learned words 
of foreign origin respect the preferences of the Beats-and-Binding-Model 
(Dziubalska-Kolaczyk 2002: 112). 

In this contribution, we stick to the longer clusters with the maximum 
number of consonants in the onset, which is three. There are eight types 
of triple initial consonant clusters in German (see Table 8). All of them 
consist of two obstruents plus a sonorant or approximant: /ftr-/ as in streng 
‘strict’, /{pr-/ as in spricht ‘s/he speaks’, /fpl-/ as in Splitter ‘splinter’; 
next in words of foreign origin /skr-/ as in skrupellos ‘ruthless’, /skl-/ as 
in sklavisch, adjective of ‘slave’. In more recent loan words we find also / 
sky-/ as in Squaw (the only integrated loan word with this cluster, with the 
possible exception of squash), /spr-/ as in Sprinter and /spl-/ as in Spleen. 


40 W. U. Dressler, A. Kononenko-Szoszkiewicz 


Table 8. Frequency ranks of triple word-initial clusters 


skl 221 7 
Spl 104 1.7 
spl 97 1.61 


These triple clusters also exhibit no grouping according to TTR; only 
one cluster has just one type. 


Table 9. Preference rankings of word-initial triples according to NAD 


scription c 1 io ion a uct cluster? 
o2 | é | 2 | 40 | Ys | 
23 4.8 25 2.4 es 


Yi 


Table 9 presents the NAD analysis of these clusters and the quantifica- 
tion of rising preferences. For word-initial consonant clusters we under- 
took an analogous factor analysis as for the word-final consonant clusters 
in section 2.2. When eliminating the first consonant, the two remaining 
NAD distances, NAD (C2C3) and NAD (CV), again showed the same 
preferences as when including the first consonant, i.e. we arrived at the 
same result as in section 2.2. 

In conclusion we can see that: 

1) All word-initial triple clusters consist of initial double obstruent 
clusters of a s(h)ibilant plus a stop followed by a rhotic or lateral sonorant 
or the fricative/approximant /v/. Other double clusters which occur in the 
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word-initial position, i.e. /bl, br, gr, gl, gn, gm, dr, xr, xt, kn, pfl, pfr, fl fv, 
Jfr, fm, fn, ps, sf, sm, sts, tsw/ cannot be part of a word-initial triple cluster, 
except for extragrammatic words such as the interjection pst, which has 
the further irregularity of containing a syllabic fricative. 

2) There is a moderate correlation between the degree of preferredness 
and the frequency in the AMC: the most preferred cluster is /fpr/, which 
has the highest token frequency and the second-highest type frequency; 
the next cluster in the hierarchy of preferences is /ftr/, which has the 
highest type frequency and the second-highest token frequency. The 
other three clusters differ little in preferredness and their frequency ranks 
decrease in parallel for types and tokens. The reason for the mismatch 
between the type and token frequency differences of /ftr/ and /fpr/ is on 
the one hand historical, insofar as they go back to the earlier clusters / 
str/ and /spr/, the only word-initial triple consonant clusters reconstructed 
with some certainty for Proto-Indo-European (Oppermann 2004). On the 
other hand, the general phonotactic preference for /{pr/ may have had a 
positive impact on its token frequency. The only dispreferred cluster /skv/ 
is rare and occurs only in one word type (or two). 


3. WORD-INTERNAL POSITION 


Word-internal clusters are presented only briefly and selectively for 
the following reasons: first of all, word-medial consonant clusters are 
much more varied and complex than initial and final ones, so that an 
equally extensive study would exceed space limits. Second, the corpus 
linguistic tools of the AMC do not permit the same procedures of analysis 
as for initial and final clusters. Third, the NAD calculator cannot pre- 
dict preferences for the many complex clusters of more than three con- 
sonants. Fourth, internal clusters are psycholinguistically less important 
than peripheral clusters due to the bathtub effect, which renders the pe- 
riphery of a unit better perceivable than its interior (Aitchison 2003: 138). 
Therefore, we limit our discussion to observations of general differences 
between morphonotactic and phonotactic consonant clusters and their ex- 
planations. 

It holds for phonotactic clusters that word-internal syllable onsets al- 
ways follow the pattern of word-initial onsets. In compounding and deri- 
vation, the syllable boundary always follows the morpheme boundary in 
consonant clusters. 

In a word-internal position, there is a much greater variety of conso- 
nant clusters than in the peripheral positions. Phonotactic clusters that 
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occur only word-internally have an internal syllable boundary, but they 
are rather few, such as /fk, dl, dv/ as in the plant name Levkoje, in Adler 
‘eagle’, where a vowel has been lost, and Advent ‘advent’, where a mor- 
pheme boundary has been lost, and /tl/ as in the loan word Atlas. There 
are a few triconsonantal phonotactic clusters, such as /ktr, ltr, mpl, rtsn, 
stm, / as in the loan words Spektrum, Altruismus ‘altruism’, Amplitude 
‘amplitude’, Arznei ‘medicine’, Asthma, thus hardly any with two ob- 
Struents. 

The bulk of new word-internal consonant clusters are morphonotac- 
tic due to the addition of morpheme-initial to morpheme-final clusters 
in compounding and affixation. This often creates morphonotactic clus- 
ters which are disallowed word-initially or word-finally and may contain 
more consonants than are permitted in the word periphery. Examples are 
the compound Herbst+pflanze ‘autumn plant’ and the suffixation herbst- 
lich ‘autumnal’, as well as the prefixation ent-springen ‘originate’. In 
compounding, interfixation may either break up (by the interfix -e-) or 
increase (by the much more frequent interfix -s-) the sequence of conso- 
nants as in Weg+e+lagerer ‘highwayman’ and Koénig+s+schloss ‘royal 
castle’. The syllable boundary is always after the interfix, which fits with 
the fact that the main morpheme boundary is always after, and never be- 
fore, the interfix. 

Verb prefixation and particle verb formation creates new word-inter- 
nal consonant clusters as well. For example, the separable particle ab- 
motivates the exclusively morphonotactic clusters /p-d, p-t, p-g, p-k, p-f, 
p-ts, p-v/, as in ab-drehen ‘turn off’, ab-geben ‘give in’, ab-kommen ‘get 
away’, ab-treten ‘wear out’, ab-schaffen ‘abolish’, ab-wickeln ‘unwind’, 
ab-ziehen ‘remove’, (with the addition of longer clusters, as in ab-streiten 
‘deny’). Moreover, some of the few non-separable verbal prefixes create 
new clusters, as with ent-, and the earlier but now only vestigial affix 
ant- as in Ant-wort ‘answer’; in the parallel formation Antlitz ‘face’ the 
morpheme boundary was lost, and the cluster became a phonotactic one. 
A morpheme boundary must also be assumed after cranberry morphs, as 
in Sint-flut ‘deluge’, cf. Flut ‘flood’. 

In contrast to many non-Germanic Indo-European languages, German 
affixation does not provoke internal vowel deletion and internal morpho- 
notactic clusters caused by it, other than of the weakest vowel schwa. An 
exception is Risiko ‘risk’ — adj. risk-ant. An epenthetic schwa is lost be- 
fore a (originally word-final) sonorant in derivation, such as in the derived 
adjectives adl-ig ‘noble’, silbr-ig ‘silvery’ (more examples in Meinhold & 
Stock 1980: 197-201). Inflectional affixation results even more rarely in 
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subtraction, which creates morphonotactic clusters, such as in Risk-en, the 
plural of Risiko (in contrast to the much greater frequency in Slavic lan- 
guages, Latin, Greek and other ancient Indo-European languages). 

In addition, word formation creates geminate consonants which are 
disallowed morpheme-internally, and phonotactically, with even more 
marked results; pseudogeminates are created by syllable- and morpheme- 
final obstruent devoicing, as in ab-bauen ‘dismantle’ with /p, b/. 

Among clusters which are both phonotactic and morphonotactic, the 
productive word formation devices of compounding, verbal prefixation 
and particle verb formation may greatly outweigh the proportion of pho- 
notactic clusters in types and tokens, e.g. for clusters starting with /-st-/, 
as in west+romisch ‘Western Roman’ and aus-treiben ‘drive out’ as op- 
posed to phonotactic cases in loan words, such as Pastrami. This may 
create problems for matching phonotactic and morphonotactic clusters in 
psycholinguistic tests. 

Only the complexity of consonant clusters, at least in terms of the 
number of member consonants and of the creation of new clusters which 
are not allowed in phonotactics, rises due to morphological operations. 
And in this sense, morphonotactic clusters are, on average, more marked 
than phonotactic clusters. 


4. CONCLUSIONS 


4.1. GENERAL RESULTS 


The claim that in general morphonotactic clusters are more dispre- 
ferred than phonotactic clusters (Dressler & Dziubalska-Kotaczyk 2006: 
83, Zydorowicz et al. 2016: 19-20) has been disproven for German pe- 
ripheral triple consonant clusters. This removes an apparent contradiction 
between the claim and external psycholinguistic evidence from acquisi- 
tion and processing experiments. In the first language acquisition of at 
least the richly inflecting languages Polish and Lithuanian, morphono- 
tactic clusters are acquired earlier than phonotactic clusters (Zydorowicz 
2010, Kamandulyté-Merfeldiené 2015). And at least in certain psycho- 
linguistic experiments (cf. the other contributions to this volume), mor- 
phonotactic clusters are processed more quickly than phonotactic ones. 
Therefore, the claim that morphonotactic clusters are more dispreferred 
than phonotactic clusters should be dropped. 

This conclusion is also supported by the ease of diachronic introduc- 
tion of new, i.e. morphonotactic clusters into languages that lacked them. 
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A further finding on diachrony is that we have found in German, in anal- 
ogy to what has been found in other languages, examples of the lexical 
development of morphonotactic clusters into phonotactic ones because 
of morphosemantic opacity leading to the loss of morpheme boundaries, 
as in Brunst ‘ardour, lust’ no longer being related to its former verb base 
brenn-en ‘burn’, except metalinguistically (cf. Dressler et al. 2019) 

Similarly to many other languages, quadruple clusters can be reduced 
in casual speech. Thus, the normal pronunciation of 2°! SG. wdsch-st 
‘(you) wash’ is [veft]. These instances are fairly regular if the NAD dis- 
tance is minimal, as in this case. 

Probably, segmentally identical phonotactic and morphonotac- 
tic clusters have different vowel durations (cf. Plag 2014; Zimmerer, 
Scharinger & Reetz 2014), but it is, as yet, unclear whether these differ- 
ences lie above the threshold of perceptibility. Moreover, other studies 
contradict these findings (see the discussion in Leykum & Moosmiiller, 
this volume). In any event, Plag is right in objecting to linguistic mod- 
els which crucially contain a flow-chart from one submodule to another 
in a way which presupposes bracket erasure (also criticized in Brown 
& Hippisley 2012: 273). Our model of morphonotactics (Dressler & 
Dziubalska-Kotaczyk 2006; Dressler et al. 2010; Korecky-Kroll et al. 
2014) does not presuppose such bracket erasure. This also fits Slovak 
word-medial patterns: assuming that in a flow-chart, inflectional mor- 
phology follows derivational morphology, the derivational boundary in 
potok ‘stream’ must not be erased in order to prevent vowel deletion in 
Gen.SG. po-tok-a/u, in contrast to the deletion of the second vowel in the 
oblique cases of ist-ok ‘source’ and otec ‘father’ (Dressler et al. 2015). 

For results regarding NAD calculations, see section 2. 


4.2. TYPOLOGICAL CONCLUSIONS 


Phonotactic asymmetries between word-initial, word-final and word- 
medial positions are well known. This starts with how the universal pref- 
erence for CV structures (Dziubalska-Kolaczyk 2002, 2009) is realized 
in the three positions and depending on whether a word is monosyllabic, 
disyllabic or polysyllabic. 

What is interesting for the typological characterization of German is 
the much greater variety and complexity of word-final than of word-initial 
clusters, e.g. in contrast to Slavic languages, Latin, Greek and other Indo- 
European languages. This asymmetry is also reflected in greater type and 
token frequencies for word-final than for word-initial obstruent clusters. 
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Type frequency asymmetries proved to be radicalized in token frequency 
differences, which means that the dominant patterns are more profitable. 

This asymmetry has two sources: on the one hand, we have the dia- 
chronic result of prehistoric or early historic major vowel deletions in 
German word-final positions as opposed to the optimal preservation of 
vowels in word-initial positions. Those lost vowels of word-final sylla- 
bles were all unstressed, which was not the case for word-initial syllables. 
On the other hand, we have the more important consequence of German 
having many short derivational and inflectional suffixes which are mono- 
consonantal or biconsonantal. But due to the restriction of morphologi- 
cal consonantism to very few consonants, already identified by Jakobson 
(1962: 108) for Indo-European languages, in German we find only final 
morphonotactic clusters ending in -¢, -s, -st, -ts. Therefore, it seems a 
paradox that we find a still more radical restriction for final phonotactic 
clusters, namely to -¢, -st and to nouns. The reason is again diachronic: all 
the final phonotactic nominal triple clusters go back or seem to go back to 
morphonotactic clusters with a final suffix now ending in -¢ due to the loss 
of unstressed vowels that followed them or a -t added secondarily in early 
New High German as a phonological addition, as in Werft ‘shipyard’, Axt 
‘axe’, Obst ‘fruit’, sonst ‘otherwise’, dialectal Senft ‘mustard’ (Kluge & 
Götze 1957 sub vocibus). 

Word-internally, the contrast between exclusively morphonotactic and 
exclusively phonotactic triconsonantal clusters seems to be even bigger. 
Also, here most triconsonantal clusters with two obstruents are only mor- 
phonotactic. An among ambiguous consonant clusters, the frequencies 
of morphonotactic clusters seem to be higher than those of phonotactic 
clusters. For efficient calculation of these frequency relations, new text- 
technological tools must be developed. 

The fact that in German peripheral positions the NAD preferences for 
consonant clusters are identical irrespective of whether the most periph- 
eral consonant is included or excluded in the NAD calculations, seems to 
be specific for Germanic languages. When we checked peripheral conso- 
nant clusters in Polish and English according to the list of clusters in Zy- 
dorowicz et al. (2016), we found that the (dis)preferredness of consonant 
clusters is different in Polish depending on whether the most peripheral 
consonants are included or excluded, but not in English. 

Polish and at least Slovak among other Slavic languages (Dressler et 
al. 2015) differ from German and English with regard to peripheral triple 
consonant clusters in the following features, which appear to be relevant 
for the impact of the most peripheral consonant on cluster preferences 
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when they are added to the more interior double consonant clusters: 

First of all, the two Slavic languages are consonantal languages to a 
higher extent than the two Germanic languages. They have a much higher 
number of different triple consonant clusters than the two Germanic lan- 
guages. For example, Polish has more than a hundred word-initial triple 
clusters, German only eight. 

Second, Polish has many more word-initial triple morphonotactic 
clusters in tokens than phonotactic clusters; the two Germanic languages 
have no word-initial morphonotactic clusters. 

Third, for word-final triple consonant clusters, the two Germanic lan- 
guages have many more morphonotactic than phonotactic clusters, all of 
them due to the morphological operation of suffixation (1.e. addition). 
Polish and Slovak have only word-final morphonotactic clusters created 
through the subtractive morphological operation of deletion of the word- 
final stem vowel in the genitive plural, e.g. in Pol. zemst vs. Nom.SG. 
zemsta ‘revenge’, Slov. pomst vs. Nom.SG. pomsta ‘revenge’. In addi- 
tion, Polish and other Slavic languages also create word-initial and word- 
medial consonant clusters due to vowel deletion in inflection and deri- 
vation, as in Pol. Gen.SG. ps-a from pies ‘dog’. German has only rare 
word-medial cases (see section 3). 

Fourth, the most peripheral German consonants in triple consonant 
clusters in a word-initial position are only /s/ and /{/ (in English only 
/s/), whereas Polish and Slovak also have many other consonants in this 
position. In word-final position the most peripheral consonants in Ger- 
man are only /t, s, ts/, in English /t, d, s, z/. These consonants are also the 
preferred final consonants in double clusters. By contrast, many different 
final consonants occur in Polish and Slovak word-final clusters. Thus, it 
seems that in the case of strong restrictions on the selection of the most 
peripheral consonants, the selection is natural, in the sense of not chang- 
ing the (dis)preferredness of the interior consonant clusters to which they 
are added. This is reminiscent of those phonotactic analyses which as- 
sume for German, as for many other languages, that any third consonant 
in a tautosyllabic consonant cluster is extrasyllabic or extrametrical (see 
Wiese 1988, 2000). 

This may also explain why, in the diachronic development of German, 
/t/ was sometimes added to a word-final consonant, as in Axt ‘axe’, Palast 
‘palace’, Obst ‘fruit’ from MHG obes, Sekt ‘sparkling wine’ from Fr. vin 
sec, dialectal Austrian German Senft — Senf ‘mustard’. 
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4.3. CONSIDERATIONS ON WORKING WITH LARGE ELECTRONIC CORPORA 


Working with large electronic corpora allows us to arrive at more re- 
liable quantitative results. Here, the type-token ratio is very low for all 
triple clusters. For quadruple clusters we found (see section 2.1) distinct 
groupings within the whole range from 0.01% to 100%. Thus, the nu- 
merically most complex clusters behave differently than the less complex 
and more numerous triple clusters. The largest subgroup of quadruple 
clusters has a similar TTR distribution to the triple ones and contains the 
only four clusters which also include a small phonotactic minority. The 
more numerous groups of quadruple clusters are only morphonotactic: 
this again indicates the marked character of complex consonant clusters. 

Our corpus-based study relied on the huge electronic corpus AMC, 
which may be the most complete print media corpus for any nation. This 
enhanced reliability for quantitative generalizations about the distribution 
of morphological and lexical patterns of consonant clusters. The disad- 
vantage that such big corpora include many erroneous types of words was 
at least partially corrected for by manual exclusion of errors and by the re- 
striction to types which have at least 5 tokens in the corpus. We included 
clusters with fewer than 5 tokens only if the cluster would otherwise not 
have been represented in our description. In discussions with other native 
speakers of German we could not think of any potential morphonotactic 
cluster which does not occur in the AMC. 

Clearly new automatic tools should be developed for reducing the 
error-prone nature of large electronic corpora. More efficient tools are 
also needed for pattern searches, as we ascertained when studying word- 
internal clusters. 

Even with better tools, the evidence from such an electronic corpus of 
written adult and adult-directed speech must be considered with caution. 
The AMC represents just one genre, and it has been found, at least for 
Modern Greek and Balto-Slavic languages (Dressler et al. 2017) that the 
distribution of lexical and morphological patterns may differ significantly 
for different genres. 
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1. INTRODUCTION 


The interaction between morphology and phonetics is an area for 
which a lot of research is still needed (see e.g. Kawahara 2011). Some 
findings favour the view that morphology does not influence speech pro- 
duction, while others indicate that an interaction between morphology 
and phonetics exists, i.e. there is an impact of morphology on the pho- 
netic realization of speech. 

One way to investigate this topic is to compare consonant combina- 
tions across word-internal morpheme boundaries (morphonotactic con- 
sonant clusters, e.g. /xt/ in German /max+t/ macht ‘(s/he) makes’), with 
consonant combinations within a single morpheme (phonotactic conso- 
nant clusters, e.g. /xt/ in German /maxt/ Macht ‘power’). Some consonant 
combinations only exist across morpheme boundaries (purely morpho- 
notactic clusters, e.g. /xst/ in German /max+st/ machst ‘(you) make’), 
whilst others exist nearly only within morphemes (predominantly pho- 
notactic clusters, e.g. /mpf/ in German /{tsumpf/ Strumpf ‘sock’). There 
are, however, several consonant combinations which occur both within 
morphemes as well as across morpheme boundaries ((mor)phonotactic 
clusters); these have been studied in the present paper. For purely mor- 
phonotactic and predominantly phonotactic clusters, the cluster itself can 
mark the presence or absence of a morpheme boundary. However, (mor) 
phonotactic clusters have no boundary-signalling function. Hence, the 
question arises of whether morpheme boundaries within consonant clus- 
ters are marked phonetically. In order to investigate this question, the 
present study analyses (mor)phonotactic consonant clusters in homoph- 
onous word pairs, in word pairs of the same grammatical category, in 
different positions within the target words (word-final and word-medial 
clusters) and in languages/varieties with different typological classifica- 
tions (word language, mixed-type language and quantifying language). 


' Acoustics Research Institute, Austrian Academy of Sciences, Vienna. 
? Subsets of the material analysed in this paper have already been analysed for con- 
ference contributions and proceedings (Leykum, Moosmiiller & Dressler 2015a; 
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2. STATE OF RESEARCH 


2.1. INFLUENCE OF MORPHOLOGY ON SPEECH PRODUCTION 


The few studies reporting an impact of morpheme boundaries on the 
phonetic realization of spoken language show diverging results. Some 
studies indicate an impact by a morpheme boundary on speech produc- 
tion: several studies (Neu 1980; Guy 1991; Guy 1996; Guy, Hay & Walk- 
er 2008; Myers 1995) on word-final /t, d/-deletion in American English 
(AE) and New Zealand English revealed that there are fewer coronal 
stop deletions when /d/ represents the regular past ending of conjugated 
verbs. Equally, for Standard Dutch, Schuppler et al. (2012) found fewer 
deletions of word-final /t/ when it constitutes a morpheme. Concerning 
word-final /s, z/ in AE, Seyfarth (2016) spotted longer durations for the 
stem and suffix of inflected verbs compared to the equivalent duration- 
al measurements for uninflected homophonous words (Pluymaekers et 
al. 2010). The above-mentioned findings, namely fewer reductions and 
fewer deletions across morpheme boundaries, can be explained by the 
importance of highlighting the morpheme boundary in order to enhance 
the comprehensibility. Other studies, however, reported an influence of 
morphology where the direction of the effect is opposed to the afore- 
mentioned findings: Plag (2014) reported shorter durations of word-fi- 
nal /s/ following a morpheme boundary for Dutch. Pluymaekers et al. 
(2010) found an influence of morphology on the phonetic realization of 
the Dutch suffix —igheid (/oxheit/): the cluster /xh/ is realized with a lon- 
ger duration when it consists of only one morpheme; it is realized with a 
shorter duration when the suffix is bimorphemic (the authors explain this 
result by the Morphological Informativeness Hypothesis). 

Contrary to these findings, other studies revealed no effect of mor- 
pheme boundaries on consonant realizations: Zimmerer, Scharinger and 
Reetz (2011, 2014) showed a large influence of the phonological context 
on the realization of word-final /t/ in German, but no influence of the 
morphological status of /t/. Equally, a study investigating realizations and 


Leykum, Moosmiiller & Dressler 2015b; Leykum & Moosmiiller 2015; Leykum 
& Moosmiiller 2016; Leykum & Moosmiiller 2017; Leykum & Moosmiiller 2018; 
Leykum & Moosmiiller 2019). References concerning the corresponding papers or 
abstracts will be given at relevant points. However, in the present paper, a broad and 
detailed analysis of phonotactic and morphonotactic consonant clusters in speech 
production is conducted, going far beyond a summary of previous studies on subsets 
of the speech material. 
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deletions of word-final /t, d/ in British English (BE) found a high impact 
of the surrounding phonemes on the realizations or deletions of /t, d/, but 
no influence due to morphology (Tagliamonte & Temple 2005). Seyfarth 
(2016) investigated AE homophones and found, for stimuli ending in [t, 
d], no influence of a morpheme boundary prior to the final stop on stem 
duration or suffix duration. 

Some studies investigated articulatory processes during the realiza- 
tion of speech segments across morpheme boundaries. Cho (2001) inves- 
tigated intergestural timing across morpheme boundaries in Korean by 
means of electromagnetic articulography (EMA) and electropalatography 
(EPG). He revealed that articulation is more stable in monomorphemes 
and more variable across word-internal morpheme boundaries (in non- 
lexicalized compounds) as well as across word boundaries. However, by 
using combined acoustic-articulatory investigation methods (EMA, EPG, 
laryngography), Nakamura (2015) detected only an influence of the pho- 
nological context, but no impact of morphology on the realization or dele- 
tion of word-final coronal stops in British English. 


2.2. AIM OF THE STUDY AND HYPOTHESIS 


Until now, acoustic investigations concerning the influence of mor- 
pheme boundaries on consonant realizations have been limited to du- 
rational measurements in a few languages. The present study not only 
investigates two languages in which the phonotactic-morphonotactic dis- 
tinction of consonant clusters has not yet been investigated (apart from 
our own studies) but also adds the investigation of intensity measure- 
ments to the analyses of durational measurements. In addition, contrary 
to most of the aforementioned studies, which analysed single consonants 
following morpheme boundaries, our study focuses on phonologically 
homophonous (mor)phonotactic consonant clusters. 

Apart from speech production, other research areas have studied pho- 
notactic and morphonotactic consonant clusters. The processing of mor- 
phonotactic clusters is assumed to be facilitated by the morphological 
function of the consonant clusters (Korecky-Kroll et al. 2014; Celata et 
al. 2015). In computer simulations, different cognitive representations for 
the two types of clusters have been revealed (Calderone et al. 2014). Con- 
cerning first language acquisition, the findings are mixed. Some studies 
found that children learn to produce morphonotactic consonant clusters 
earlier compared to phonotactic consonant clusters (Kamandulyté 2006; 
Zydorowicz 2007), while others concluded that children learn both types 


56 H. Leykum, S. Moosmiiller 


of clusters at the same time (Freiberger 2007). The aforementioned in- 
vestigations point out that in speech processing, computer simulations, 
and language acquisition, differences between the two types of clusters 
could exist. Therefore, as an extension of the Strong Morphonotactic Hy- 
pothesis (Dressler & Dziubalska-Koltaczyk 2006), which is restricted to 
an interaction between morphology and phonology (not phonetics), the 
hypothesis of the present study predicts that these differences also exist 
in speech production, even though the rare findings on speech production 
are mixed. The hypothesis is as follows: 

Consonant clusters across word-internal morpheme boundaries (mor- 
phonotactic clusters) are expected to be more robust and more highlight- 
ed in speech production than consonant clusters within a morpheme (pho- 
notactic clusters). 

Since language-specific differences are possible, three different lan- 
guage types are compared in the present study: a word language (Standard 
German German (SGG)), a mixed-type language (Standard Austrian Ger- 
man (SAG)) and a quantifying language (Standard French (FR)). These 
three types were chosen to investigate whether language-type-specific 
timing characteristics have an influence on the highlighting/reduction of 
consonant clusters. In quantifying languages, a distinction between ho- 
mophonous phonotactic and morphonotactic clusters may disturb the tem- 
poral pattern of the language (Moosmiiller & Brandstatter 2014). Thus, 
reductions of phonotactic clusters and/or lengthening of phonemes in mor- 
phonotactic clusters are expected to be less probable in quantifying lan- 
guages. Therefore, with regard to the language type, it is hypothesized that 
durational differences between phonotactic and morphonotactic clusters 
will be more pronounced in SGG as compared to SAG, and the differences 
are expected to be greater for both varieties of German than those in FR. 


2.3. MATERIAL AND GENERAL METHODS 


Stimuli 


Comparisons of the acoustic characteristics of consonant clusters 
within morphemes and across word-internal morpheme boundaries are 
only conclusive when the clusters are phonologically homophonous. 
Therefore, for the present study, several (mor)phonotactic consonant 
clusters were chosen which occur in the same position within words, once 
as a phonotactic cluster, and once as a morphonotactic cluster (emerging 
from productive word-formation rules). Since morphonotactic consonant 
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clusters in the word-initial position are not possible in German, only con- 
sonant clusters in a word-medial and word-final position were considered 
for the present investigation. 

The target words were nouns, verbs, and adjectives with a (mor)pho- 
notactic consonant cluster in a word-final or word-medial position. Within 
each word pair, the phonemes preceding and (for word-medial clusters) 
following the consonant cluster were kept as constant as possible to mini- 
mize the influence of the phonological context on the realization of the 
consonant cluster. Therefore, the target words with word-final consonant 
clusters were pairs of homophonous words, which raises the problem that 
we have to compare nouns and conjugated verbs. For the target words with 
word-medial consonant clusters, word pairs belonging to the same gram- 
matical category were chosen. The target words are listed in Table 1. Since 
the word pairs were not matched for word frequency, this variable was 
controlled for statistically. Word frequency values were extracted from 
http://wortschatz.uni-leipzig.de (Quasthoff, Goldhahn & Heyer 2013). 


Participants 


Recordings of 16 speakers of Standard Austrian German (SAG) were 
made. All these SAG speakers were, as defined by Moosmüller (1991), 
students (younger age group) or university graduates (younger and older 
age group) who were born and raised in Vienna, with at least one parent 
fulfilling the same criteria. The speakers can be assigned to two equal age 
groups: the younger speakers were between 18 and 25 years old; the older 
speakers were 45—60 years old. In both age groups, the speakers were 
balanced for gender. 

Additionally, recordings of six younger speakers (18—25 years) of Stan- 
dard French (FR) and eight speakers (18—25 years) of Standard German 
German (SGG) were conducted. In both groups, the speakers were bal- 
anced for gender. The speakers of FR were students or university gradu- 
ates originating from the region Ile-de-France; all speakers of SGG were 
born and raised in the northern part of Germany (north of the Benrath line). 
For all participants, the same criteria were fulfilled by at least one parent. 


Recordings 


The recordings were conducted in a semi-anechoic sound booth (IAC- 
1202A). In the recording session, after a semi-structured interview, the 
participants undertook several reading tasks. For one reading task, the 
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target words were embedded in carrier phrases in a post-focal position. 
For this, the participants were told that they had to correct a misunder- 
standing concerning the addressee of an utterance. In the sentences, the 
pronoun or name was printed in bold, and the participants were asked to 
stress the pronoun/name when reading the sentences. This type of carrier 
phrase and the corresponding instructions were chosen to avoid stress on 
the target word, to enable phonetic reduction processes. The target word 
was always followed by the word gesagt ‘said’ to control the follow- 
ing phonological context for words with a word-final consonant cluster. 
The sentence finished with glaube ich ‘I think’ to avoid a sentence-final 
lengthening starting already in the target word. The following sentences 
are two examples of sentences for the first reading task: 

Zu thr? - Ich habe zu ihm „die Hast“ gesagt, glaube ich. 

‘To her? - I said to him “the hurry”, I think.’ 

Zu mir? - Ich habe zu Peter ,,er macht“ gesagt, glaube ich. 

‘To me? - I said to Peter “he makes”, I think.’ 

In a second speaking task, semi-spontaneous speech was elicited. In this 
task, the speakers had to read a given question (in which the target word 
was already mentioned) and answer the question by including two given 
words in their answer. The first given word was the target word, and the 
second word was given to draw attention away from the target word and to 
facilitate the task. Only SAG and FR speakers performed the semi-sponta- 
neous task. Here are two examples of the semi-spontaneous speaking task: 


Hast, Schlüssel Was vergisst dein Nachbar oft in der Hast mit- 
zunehmen? 

‘hurry, keys’ “What does your neighbour often forget when he 
is in a hurry?’ 

hasst, Katzen Hasst Herr Müller Hunde? 

‘hates, cats’ ‘Does Mr. Müller hate dogs?’ 


Possible answers by the participants for the first question were: In der 
Hast vergisst er seine Schlüssel ‘When he is in a hurry, he forgets his 
keys’, and for the second example: Nein, Herr Miiller hasst Katzen ‘No, 
Mr. Müller hates cats’. 

Additionally, some of the target words with word-final consonant 
clusters were embedded in more natural sentences. In this, for the target 
words, which were verbs, the subject pronoun and the verb were sepa- 
rated to reduce the redundant coding of the morpheme boundary. In ad- 
dition, the target word was always followed by a word starting with /g/, 


II. Morphonotactics in speech production 59 


to reduce the impact of the phonological context. These sentences were 
only read by the speakers of SAG. Two examples of this second reading 
task are given below: 


Die Zeit misst gleich in der nächsten Runde Matthias. 
‘The time will be measured in the next round by Matthias.’ 


Ihr Freund hat gesagt, dass er sie nicht wirklich hasst, glaube ich. 
‘Her friend said that he does not really hate her, I think.’ 


For both reading tasks, the sentences were put in random order and read 
by the participants twice within the larger recording session. The semi- 
spontaneous speaking task was conducted only once. After subtracting 
a few mispronounced and misread items, this resulted in a total of 2,402 
analysable target words (SAG + SGG + FR; word-medial + word-final). 

In order to conduct the acoustic analyses, the recordings were manu- 
ally segmented and annotated with STx (Noll et al. 2007) on a sentence, 
word and phoneme level. The duration and intensity values of the fol- 
lowing segments were measured and semi-automatically extracted: target 
words, surrounding words, consonant clusters, individual consonants of 
the clusters, and phonemes surrounding the clusters. 

The data was statistically analysed with R (R Core Team 2015) by us- 
ing mixed-effects models (Bates et al. 2015). The variables subject and 
word were included in the models as random factors. Additionally, the fol- 
lowing control variables were included in the models whenever they had 
an effect on the dependent variables: word frequency, articulation rate, /t/- 
deletions, stress on the target word, and pauses following the target word. 

The mixed-effects models were fitted using a forward approach: ef- 
fects were added one by one. Based on the p-value, a decision was made 
on whether to keep the variable or interaction in the model or to exclude it 
(threshold: p = 0.1). Where necessary, Tukey post-hoc tests with p-value 
adjustment were carried out. 

To normalize the data, two different methods were used: on the one 
hand, the total duration (or mean syllable duration for word-medial clus- 
ters) or intensity of the target word were included in the statistical analy- 
ses to control statistically for any impact of speaker-specific differences. 
On the other hand, the relative duration of each cluster or consonant was 
calculated by dividing the segment duration by the word duration, cluster 
duration or mean syllable duration (for word-medial clusters). To calcu- 
late the relative intensity, the intensity of the segment was divided by the 
mean word intensity or cluster intensity. The normalization method used 
for each analysis is indicated in the following section. 


H. Leykum, S. Moosmiiller 


60 


quausadxgq jyundxı 4 juəuniədxq jyundxı 4 


Jəipdxə asadxy AaUupAvaxy asadxy AaUupADaX 
anbayjoosip 


(a)nuyuoosip yaysoysiq | yryj4ainuyuoysip YOYIOYsi | YPp4ainuyjuoysip 


yosnunysı yousnDy yosmuDys! yousnpy 
© əsmpuvjsı| —uoyvoojsip| yosipupjst| yonsa yosipupjst| 4211501 


SNUSIUDSIC) INUSSIJN SNUSIUDSAQ INUSSIJN 


ƏNDIWUSOI| UOISSIUSUDA]J yəsyəwusoy | B1/DWUWSIU]]DYsIA YISIAUSOY | SışfpusıupyAa 
WAN]YUNT 
ayssny 


aisspd 


ADJYIDAT 


91198} 31798} 
oogjzouoyd : onovjouo"ud IPegzouoydirow oogjzouoyd ayogjouoydiow | onoezouoyd : 


yqousty DDS Terpow-pIom OVS feug-pIoľm OVS 


Jgysnio 


(p IJQEL pUe Z IJqQeL Ul poprAold ore suonejsuen) SPIOM JOSIVL “| IALL 


II. Morphonotactics in speech production 61 


3. ACOUSTIC ANALYSES 


3.1. WORD-FINAL CLUSTERS IN SAG AND SGG 


First, word-final consonant clusters in homophonous word pairs real- 
ized by speakers of SAG (16 speakers) and SGG (8 young speakers) were 
compared (see also Leykum & Moosmiiller 2015, Leykum et al. 2015a, 
Leykum & Moosmiiller 2016). The target words were realized by all 
speakers twice within the carrier phrases. Moreover, the speakers of SAG 
conducted two additional tasks: they read sentences in which the sub- 
ject pronoun and verb were separated for the bimorphemic target words 
(twice), and they realized the target word once in the semi-spontaneous 
speaking task. 

The investigated target words were the following: 


Table 2. Target words with word-final consonant cluster? 


Cluster | phonotactic morphonotactic | phonotactic morphonotactic 
/ft/ Schaft schafft Schaft schafft 

‘stem’ ‘(s/he) creates’ ‘stem’ ‘(s/he) creates’ 
/xt/ Macht macht Macht macht 

‘power’ ‘(s/he) makes’ ‘power’ ‘(s/he) makes’ 
/pst/ Propst probst 


‘provost’ ‘(you) rehearse’ 


/nkst/ Hengst hdngst Hengst hdngst 
‘stallion’ ‘(you) hang’ ‘stallion’ ‘(you) hang’ 
/nst/ Dienst dienst 
‘service’ ‘(you) serve’ 
ht/ /nd/ | Rind rinnt 
‘beef, cow’ ‘(it) flows’ 


/st/ Mist misst 
‘dung, rubbish’ ‘(s/he) measures’ | ‘dung, rubbish’ ‘(s/he) measures’ 
Hast hasst Hast hasst 
‘hurry’ ‘(s/he) hates’ ‘hurry’ ‘(s/he) hates’ 


3 Even though the orthography differs, for all word pairs, the item with a phonotactic 
cluster and its counterpart with a morphonotactic cluster are phonemically homopho- 
nous. 
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Results 


/t/-deletion 


In the word-final position, /t/ was acoustically deleted in several cas- 
es. In total, /t/-deletions occurred in 11.18% of the phonotactic clusters, 
whereas in morphonotactic clusters, 13.64% of word-final /t/ were acous- 
tically deleted. The deletion rates did not significantly differ between the 
two types of clusters (z = -0.877, p = 0.381). Since the deletion rate is 
highly influenced by the phonological context (z = 3.777, p < 0.001), only 
the /t/s followed by /g/ were regarded in the next step. Out of these clus- 
ters, 11.16% of the phonotactic clusters were realized without the /t/, and 
10.38% of the morphonotactic clusters (here again, there is no significant 
difference between the two types of clusters: z = -0.220, p = 0.826). 

Concerning the segmental context, the deletion rate of /t/ was highest 
when the preceding phoneme was the homorganic fricative /s/ as com- 
pared to the other preceding contexts (z = -4.139, p < 0.001; /t/-deletions 
following /s/: 16.09% in phonotactic clusters, 16.23% in morphonotactic 
clusters; /t/-deletions following other phonemes: 3.36% in phonotactic 
clusters, 7.66% in morphonotactic clusters, see Figure 1). 


/t/-realizations and /t/-deletions 


ce ee 


realized /t/ m/t/-deletion 


Figure 1. Percentages of /t/-realizations and /t/-deletions 


Relative duration of the cluster 


The fitted mixed-effects models revealed the following significant ef- 
fects for the relative duration of the entire cluster (in % of word dura- 
tion): a type-of-cluster*speaking-task interaction (F(2,1383) = 20.800, 
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p < 0.001), a type-of-cluster*/t/-realization interaction (#(1398) = 3.210, 
p= 09.001), a gender*variety/age interaction (F(2,18) = 3.940, p = 0.037), 
a main effect of articulation rate (#(1395) = -10.670, p < 0.001), and a main 
effect of the cluster (F(6,7) = 47.035, p <0.001). Post-hoc analyses showed 
a significant type-of-cluster difference for the type-of-cluster*speaking- 
task interaction in the additional speaking task only, where subject pro- 
noun and conjugated verb were separated. Here, the phonotactic clusters 
were shorter compared to the morphonotactic clusters (t(24) = 3.629, 
p=0.015). The type-of-cluster*/t/-realization interaction revealed shorter 
durations for both types of clusters when the word-final /t/ was deleted. 
This effect was slightly larger for morphonotactic clusters (phonotactic: 
t(1408) = 6.723, p < 0.001; morphonotactic: 41411) = 11.539, p < 0.001, 
see Figure 2). A closer look at the gender*variety/age interaction revealed 
that the clusters of the elder female SAG-speakers were shorter compared 
to all other groups of speakers (see Table 3). 


Table 3. Gender*variety/age-interaction (post-hoc tests) 


contrast df t-value | p-value | signifi- 
cance 
SAG: elder, female — SAG: elder, male 16.89 -3.752 | 0.0168 = 


SAG: elder, female — SAG: young, female 18.11 -2.950 | 0.0776 f 

SAG: elder, female — SAG: young, male i 
SAG: elder, female — SGG: young, female ae 
SAG: elder, female — SGG: young, male Ft 
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Figure 2. Interaction type-of-cluster*/t/-realization 
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Relative duration of /t/ 


Concerning the duration of /t/ in relation to the duration of the entire 
word (% of word duration), the statistical analyses revealed a significant 
three-way interaction between the task, the word frequency and the type 
of cluster (F(2,1194) = 5.291, p = 0.005), and main effects of articula- 
tion rate (4/1114) = 2.641, p = 0.008), variety/age (tendency: F(2,20) = 
2.909, p = 0.077; elder SAG < younger SAG < SGG speakers), gender 
(tendency: #(19) = 1.994, p = 0.061; female < male speakers), and cluster 
(F(6,7) = 17.111, p < 0.001). Post-hoc analyses showed that only for the 
speaking task with separated pronoun and verb was it the case that the 
higher the word frequency, the more the two types of clusters differed in 
their length; with /t/ being relatively longer in morphonotactic clusters 
(see Figure 3). 


Relative duration of /t/ 


Reading task 1: carrier phrases Semi-spontaneous speaking task Reading task 2: separated subject pronoun and verb 
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Figure 3. Three-way interaction: task*word-frequency*type-of-cluster 


The effect of the articulation rate (longer relative duration of /t/ for 
higher articulation rates) emerged due to an articulation rate-induced 
shortening of the entire word (especially the vowel: main effect of articu- 
lation rate (¢(1230) = -8.621, p < 0.001). 


Relative intensity of the clusters 


The fitted mixed-effects model showed that the relative intensity of 
the clusters (in % of word intensity) is significantly influenced by an 
interaction between the task and /t/-realization (F(2,1406) = 4.114, p = 
0.016), a word-frequency*articulation-rate interaction (#(1397) = -2.780, 
p = 9.005), a main effect of gender (t(22) = 3.562, p = 0.002), and a main 
effect of cluster (F(6,9) = 130, p < 0.001). No influence of the type of 
cluster was found (p = 0.804). A post-hoc test concerning the task*/t/- 
realization interaction revealed significantly lower relative cluster in- 
tensities of clusters with realized final /t/ compared to the clusters with 
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/t/-deletion for both reading tasks (carrier phrases: (1409) = -3.215, p = 
0.017; second reading task: 7(1401) = -3.659, p = 0.004), but not for the 
semi-spontaneous speaking task (#(1402) = -0.237, p = 0.999). 


Relative intensity of /t/ 


The relative intensity of /t/ (in % of word intensity) 1s influenced by an 
interaction between the type of cluster and the task (F(2,1122) = 6.657, 
p = 9.001), an interaction of task and gender (F(2,1204) = 9.462, p < 
0.001), a main effect of articulation rate (#11213) = 2.989, p = 0.003), and 
a main effect of cluster (F(6,12) = 13.959, p < 0.001). Post-hoc analy- 
ses revealed that in none of the speaking tasks did phonotactic and mor- 
phonotactic clusters differ in their relative intensity of /t/. However, the 
relative intensity of /t/ was significantly lower in the speaking task with 
a separated subject pronoun and verb as compared to /t/ in target words 
embedded in the carrier phrases. This effect was slightly larger for the 
phonotactic clusters (morphonotactic clusters: (1204) = 4.058, p < 0.001; 
phonotactic clusters: (1147) = 7.607, p < 0.001, see Figure 4). 
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Figure 4. Interaction type-of-cluster*task 
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Discussion of word-final consonant clusters 


Regarding the number of acoustic deletions of /t/ and the relative in- 
tensity of the cluster, no significant difference between phonotactic and 
morphonotactic clusters exists. For the other investigated variables, in- 
teractions including an effect of the type of cluster reached significance. 
However, no main effects of the type of cluster were found. The interac- 
tions were, with one exception, all interactions with the speaking task. 
The additional speaking task was designed to test whether the redundant 
coding of the information given by the conjugational morpheme reduces 
the importance of a highlighting of the morpheme boundary, which could 
result in less highlighting of morphonotactic clusters. Therefore, oppos- 
ing effects could explain the lack of a difference between the phonetic re- 
alization of phonotactic and morphonotactic clusters in the other speaking 
tasks. However, the effects of the present study could not be interpreted as 
evidence for this hypothesis, since the additional speaking task involved a 
highly unnatural wording for some of the sentences, which in itself results 
in a higher articulation accuracy. The target words containing phonotactic 
clusters were also embedded in the sentences. For the nouns, however, 
the context was more natural, possibly resulting in a less accurate articu- 
lation. In addition, some of the target words with a morphonotactic cluster 
were in a phrase-final position, resulting in phrase-final lengthening of 
the target word. 

With regard to the relative duration of the cluster, the type-of- 
cluster*/t/-realization interaction showing a slightly larger difference 
between clusters with and without final /t/ for morphonotactic clusters 
compared to phonotactic clusters seems to be a random result, which pos- 
sibly emerged due to differences between the clusters themselves and the 
low number of clusters with /t/-deletion. 

The effects of the type of cluster emerging in the analyses can easily 
be explained by the unnatural wording, and by differences in the posi- 
tions of the target words within the sentences in the second reading task. 
However, when investigating word-final consonant clusters in German 
homophones, the lack of an effect of a cluster-internal morpheme bound- 
ary on speech production could not be interpreted as evidence for the non- 
existence of an influence of the morpheme boundary on the realization of 
morphonotactic consonant clusters. Within each word pair, the stimuli not 
only differed in being monomorphemic or bimorphemic, but also in the 
grammatical category to which the target words belong. 
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3.2. WORD-MEDIAL CLUSTERS IN SAG, SGG AND FR 


In a further step, the (mor)phonotactic clusters in word-medial po- 
sition were investigated (see Leykum & Moosmiiller (2019) for word- 
medial clusters in SAG; Leykum & Moosmiiller (2017) for a comparison 
of the three languages/varieties). Here, in most cases, the grammatical 
category was identical for both stimuli within each word pair. The target 
words are listed in Table 4. 


Table 4. Target words with word-medial consonant cluster (p = phonotactic, m = mor- 
phonotactic; the word pairs not matched for grammatical category are shaded) 


SAG word-medial 


m 


Cluster 

/st/ Paste passte 
‘paste’ ‘(it) fitted’ 

küsste 
‘(s/he) 
kissed’ 

/xt/ Frachter 
‘cargo 
ship’ 

/gkt/ Akupunk- | Funkturm 
tur ‘radio 
‘acupunc- | tower’ 
ture’ 

/sm/ kosmetisch | verhalt- kosmetisch | verhält- cosmétique | transmis- 
‘cosmetic’ | nismäfig | ‘cosmetic’ | nismäfpig | ‘cosmetic’ | sion 

‘relative’ ‘relative’ “transmis- 
sion’ 
Organis- | Missmut Organis- | Missmut 
mus ‘displea- | mus ‘displea- 
‘organism’ | sure’ ‘organism’ | sure’ 

/s\/ islandisch | löslich isländisch | löslich islandaise | dislocation 

‘Icelandic’ | ‘soluble? | ‘Icelandic’ | ‘soluble’ ‘Icelandic’ | ‘disloca- 
tion’ 

islamisch | hduslich islamisch | hduslich 

‘Islamic’ | ‘domestic’ | ‘Islamic’ ‘domestic’ 

/sk/ Diskothek | diskon- Diskothek | diskontinu- | disco- discontinue 
‘disco- tinuierlich | ‘disco- ierlich thèque ‘discon- 
theque’ ‘discon- theque’ “discon- “disco- tinuous’ 

tinuous’ tinuous’ theque’ 
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/ksp/ Experte Experte Expartner | expert expatrier 
‘expert’ ‘expert’ ‘ex-part- expen ‘expatriate’ 
ner’ 
Experi- Fixpunkt | Experiment | Fixpunkt 
ment ‘fixed ‘experi- ‘fixed 
“experi- point’ ment’ point’ 
ment’ 


The target words were realized twice by all 30 speakers (16 SAG, 8 
SGG, 6 FR) within the carrier phrases. In addition, the speakers of SAG 
and FR conducted the semi-spontaneous speaking task. 


Results 
Absolute cluster duration 


The statistical analyses (with relative syllable duration as the control 
variable) showed a significant interaction between language/variety and 
articulation rate (F(3,753)=9.863, p<0.001), an interaction between ar- 
ticulation rate and cluster (F(6,918) = 11.944, p < 0.001), and a main 
effect of the speaking task (41912) = -3.018, p = 0.003; shorter clus- 
ters in the semi-spontaneous speaking task). Concerning the language/ 
variety“ articulation-rate interaction, a decrease in the cluster duration with 
increasing articulation rate was slightly steeper for the speakers of SGG 
compared to the other groups of speakers. The articulation-rate*cluster 
interaction emerged because the duration of the clusters /nkt/ and /ksp/ 
was more affected by the articulation rate than the other clusters. The 
duration of the cluster /sk/ was least influenced by the articulation rate. A 
morpheme boundary within the clusters had no influence on the duration 
of the clusters (p = 0.864). 

Since the material is not well balanced, another mixed-effects model 
was fitted for a subset of the data. Here, only the stimuli embedded in 
the carrier phrases were analysed. Furthermore, the French items and the 
word pairs Paste-passte ‘paste-fitted’, Ktiste-ktisste ‘coast-kissed’ and 
Diskothek-diskontinuierlich ‘discotheque-discontinuous’ were excluded 
so that only word pairs matched for their grammatical category were 
used, to enhance the comparability. For this subset of data, a three-way 
interaction between gender, type of cluster and grammatical category 
emerged (F(1,463) = 7.398, p = 0.007). Post-hoc analyses showed lon- 
ger durations of phonotactic clusters for both genders for adjectives. 
Concerning nouns, no effect occurred for female speakers. For male 
speakers, however, the phonotactic clusters were shorter compared to 
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the clusters of female speakers and compared to male speakers produc- 
ing morphonotactic clusters (see Figure 5). An articulation-rate*cluster 
interaction (F(4,470) = 5.970, p < 0.001) revealed the same effects as 
the analyses of the entire dataset (see above). In addition, an effect of 
variety/age (F(2,12) = 21.320, p < 0.001) revealed that speakers of SGG 
produced the clusters with longer durations compared to both age groups 
of speakers of SAG. 


Cluster duration 
Interaction gender*type-of-cluster*grammatical category 


220 


2 
= Gender:type of cluster 


female:morphonotactic 
= female:phonotactic 
189 = male:morphonotactic 


| | male:phonotactic 


fit (cluster duration - in ms) 


160 


140 


adjective noun 
Grammatical category 


Figure 5. Interaction gender*type-of-cluster* grammatical category 


Relative cluster duration (in % of mean syllable duration) 


When normalizing the cluster duration by using the mean syllable du- 
ration, the fitted mixed-effects model showed the following significant 
effects: a task*articulation-rate interaction (F(1,878) = 9.315, p = 0.002), 
a main effect of language/variety (F(3,24) = 7.962, p < 0.001), and a main 
effect of cluster (F(6,21) = 3.210, p = 0.021). A morpheme boundary 
within the cluster had no effect on the relative duration of word-medial 
clusters (p = 0.461). 
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Duration of the cluster-final consonant relative to the cluster duration 


When dividing the clusters at the position of the morpheme bound- 
ary of the morphonotactic clusters (/xt/ — /x+t/, /nkt/ — /nk+t/, /sm/ —> 
/st+m/, /sl/ — /s+1/, /sk/ — /s+k/, /ksp/ — /ks+p/) and dividing the dura- 
tion of the second part of the cluster by the total cluster duration, the rela- 
tive duration of the cluster-final consonant is calculated. The statistical 
analyses revealed that the relative duration of the cluster-final consonant 
is influenced by an interaction between articulation rate, gender and task 
(F(1,868) = 4.014, p = 0.045): for target words in carrier phrases, the 
relative duration of the cluster-final consonant is influenced by the articu- 
lation rate only for female speakers. In the semi-spontaneous speaking 
task, the articulation rate does not influence the duration of the cluster- 
final consonant. In addition, a main effect of cluster (F(6,20) = 7.800, p < 
0.001) occurred. The type of cluster had no influence on the relative dura- 
tion of the cluster-final consonant (p = 0.307). When reducing the data to 
a subset of the stimuli which were balanced in terms of the grammatical 
category, a tendency for an effect of the grammatical category (F(2,19) = 
4.723, p = 0.059) showed longer durations for the cluster-final consonant 
in adjectives compared to nouns. However, this is not a global effect of 
differences between nouns and adjectives, but more likely an effect aris- 
ing due to differences between the different word pairs. 


Absolute intensity of the cluster 


The absolute intensity of the investigated word-medial consonant clus- 
ters is influenced by a main effect of gender (t(22) = 2.929, p = 0.008), 
with higher intensity of the clusters realized by male speakers. In addi- 
tion, a main effect of cluster (F(6,22) = 20.900, p < 0.001) occurred, and 
a main effect of speaking task (t(925) = 3.890, p < 0.001), with higher 
intensities in the semi-spontaneous speaking task. The type of cluster 
had no significant influence on the absolute intensity of the clusters (p = 
0.125). 


Relative intensity of the cluster (relative to the intensity of the vowel 
preceding the cluster) 


When normalizing the cluster intensity by calculating the intensity 
in relation to the intensity of the vowel preceding the cluster, besides 
an effect of cluster (F(6,19) = 9.128, p < 0.001), a significant three-way 
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interaction between word-frequency*articulation-rate*language/variety 
(F(3,920) = 5.794, p < 0.001) occurred. In this, for the speakers of SGG, 
the relative intensity of the clusters decreased with increasing word fre- 
quency for higher articulation rates. The other groups of speakers did not 
show such an effect. Here again, the type of cluster did not influence the 
intensity values of the clusters (p = 0.133). 


Intensity of the cluster-final consonant relative to the cluster intensity 


The relative intensity of the cluster-final consonant was calculated by 
dividing the intensity of the consonant by the intensity of the cluster. The 
fitted mixed-effects model resulted in a main effect of language/variety 
(F(3,24) = 5.039, p = 0.008), a main effect of cluster (F'(6,21) = 87.051, p 
< 0.001), and a main effect of task (#(938) = 3.727, p < 0.001). Post-hoc 
analyses showed that the relative intensity of the cluster-final consonant 
is lower in FR compared to SAG and SGG. In addition, it is higher in the 
semi-spontaneous speaking task. The effect of language/variety emerged 
due to language- and item-specific word-stress differences. The type of 
cluster had no influence on the relative intensity of the cluster-final con- 
sonant (p = 0.118). 


Discussion of word-medial consonant clusters 


One advantage of investigating word-medial clusters is the possibil- 
ity to compare word pairs of the same grammatical category, as well as 
to compare French and German consonant clusters, as some consonant 
clusters exist in both languages, both within morphemes and across mor- 
pheme boundaries in a word-medial position. However, there are also 
several disadvantages: it is not possible to control the phonological con- 
text as much as for clusters in homophones; the target words are more 
diverse, not only in terms of the phonemes surrounding the consonant 
clusters, but also in terms of the exact position of the consonant clusters 
within the words, and, most importantly, in terms of the position of the 
word stress within the German target words. 

In order to investigate the impact of the grammatical category of the 
target words on the realization of the consonant clusters, the dataset was 
restricted to a subset including only word pairs where both items within 
each pair belong to an identical grammatical category. These analyses 
revealed an effect from the grammatical category in only two of the fitted 
models: concerning the absolute cluster duration and concerning the rela- 
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tive duration of the cluster-final consonant. With regard to the absolute 
duration of the cluster, the grammatical category was part of a three-way 
interaction, revealing a longer duration of phonotactic clusters for adjec- 
tives for all speakers and a shorter duration of the phonotactic clusters for 
nouns when realized by male speakers. Furthermore, the relative duration 
of the cluster-final consonant was longer in adjectives compared to nouns. 
Since there were no interactions between the grammatical category and 
the type of cluster besides the three-way interaction, it could be conclud- 
ed that for adjectives and nouns, a possible influence of the grammatical 
category could be ruled out as a factor which could mask effects of a mor- 
pheme boundary on the realization of word-medial consonant clusters. 
In addition, concerning the three-way interaction affecting the absolute 
cluster duration, the effect on the adjectives is in the opposite direction 
to the hypothesis: in adjectives, phonotactic clusters were longer than 
morphonotactic clusters. Concerning nouns, the phonotactic clusters of 
male speakers were shorter compared to the other clusters, which could 
be interpreted as less reduction of the morphonotactic clusters by male 
speakers, compared to a low level of reductions by female speakers, ir- 
respective of the presence or absence of a morpheme boundary. 

Concerning all other investigated variables, no effect of a consonant 
cluster internal morpheme boundary on the realization of the word-medi- 
al clusters was detectable. 


4. GENERAL DISCUSSION 


Previous studies investigating the influence of morpheme boundaries 
on speech production came to different results. Some studies revealed an 
effect indicating an acoustic highlighting of the morpheme boundary by 
lengthening phonemes across morpheme boundaries. Other studies found 
no effect of a morpheme boundary, or even indicated results with an ef- 
fect in the opposite direction. Likewise, in the present study, for some 
variables an effect of the type of cluster emerged, either in the expected 
direction or in the opposite direction. However, most analyses did not find 
any effect of the morpheme boundary. 

Since the effects of an influence by the morpheme boundary can all 
be easily explained by other interfering variables, the present study is not 
able to give any evidence for an impact of morphology on speech pro- 
duction. However, the absence of any effects does not necessarily imply 
that no influence of morpheme boundaries on the realization of consonant 
clusters exists. 
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The possibility that the highly redundant coding of the information of 
the morpheme boundary in conjugated verbs with word-final morphono- 
tactic consonant clusters leads to a less accurate articulation cannot be 
ruled out. Opposite effects caused by the morpheme boundary and the 
high redundancy are still possible. Due to the unnatural wording and the 
lack of a possibility to match the target words in terms of the position 
within the sentences, the additional reading task did not provide conclu- 
sive findings. 

Another factor linked to the redundancy, described by Hanique and 
Ernestus (2012: 175), is the word-information load: “The less a segment 
contributes to distinguishing the complete word from other words, the 
more it may be reduced”. Equally, the degree of morphological decom- 
posability could constitute a factor influencing whether morphonotactic 
consonant clusters are treated differently from phonotactic clusters in 
speech production. 

The present findings were able to rule out language- and/or variety- 
specific timing characteristics as a factor inhibiting an acoustic differ- 
entiation between phonotactic and morphonotactic clusters (see also 
Leykum & Moosmüller 2017). However, besides language-specific tim- 
ing characteristics, other language-specific differences could exist. The 
investigated languages share a low morphological richness, raising the 
question of whether the morphological richness of a language determines 
whether phonotactic and morphonotactic clusters behave the same or not. 
It is possible that in morphologically richer languages, the information 
about the morpheme boundary is more important to ensure intelligibility. 
A fact supporting this hypothesis is research on first language acquisi- 
tion. It was shown that children acquiring Austrian German learn both 
types of clusters at the same time (Freiberger 2007), whereas, in the first 
language acquisition of the morphologically richer languages Polish and 
Lithuanian, children learn to produce morphonotactic consonant clusters 
correctly prior to phonotactic consonant clusters (Kamandulytė 2006; Zy- 
dorowicz 2007). 


5. CONCLUSION 


Combining the present findings with analyses of the subsegmental 
parts of /t/ in word-final clusters (Leykum & Moosmüller 2015; Leykum 
et al. 2015b; Leykum & Moosmüller 2018), conducted on the same ma- 
terial, none of the analyses could prove that morphonotactic consonant 
clusters are more highlighted or less susceptible to reduction processes. 
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Yet, the present analyses do not prove that phonotactic and morphonotac- 
tic consonant clusters are identical in their phonetic realization, since sta- 
tistically insignificant results do not imply that no effect exists. However, 
the fact that quite a lot of analyses were conducted on a relatively large 
dataset, all showing no or no stable effect of the morpheme boundary on 
speech production, leads us to the conclusion that it is very unlikely that 
speakers realize morphonotactic consonant clusters in German differently 
because of the morpheme boundary. 
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III. The acquisition and processing of 
(mor)phonotactic consonant clusters in German 


SABINE SOMMER-LOLEI!” 
KATHARINA KORECKY-KROLL? 
MARKUS CHRISTINER?*4 
WOLFGANG U. DRESSLER! 


1. INTRODUCTION 


The aims of this psycholinguistic contribution are to show how mor- 
phonotactic and phonotactic German consonant clusters differ 1) in ear- 
ly spontaneous first language acquisition and 2) (more importantly) in 
processing experiments, under which conditions one of the two cluster 
types is acquired earlier and processed more accurately and with shorter 
latency, as well as what the impact of frequency, familiarity and foreign- 
ness on the processing of simple words, compounds and morphological 
derivatives is. 

Phonotactics and morphotactics interact in the area of morphonotac- 
tics (Dressler & Dziubalska-Kołaczyk 2006). As will be shown in the 
following sections of this chapter, consonant clusters with and without 
morpheme boundaries are a good testing ground for the investigation of 
morphonotactics. Whereas phonotactic consonant clusters are found in 
simple word stems (e.g. German Wicht ‘wight’ ), morphonotactic clusters 
cross morpheme boundaries in inflected, derived or compound words 
(e.g. German (er/sie/es) mach-t ‘(s/he/it) makes’, Reich-tum ‘richness’, 
Pech+tag ‘off-day’). Sometimes morphonotactic clusters are entirely 
new consonant clusters that can only be found in morphologically com- 
plex words (e.g. German ruf-st ‘(you) call’), but sometimes they may 
also be homophonous with existing phonotactic clusters (e.g. German 
morphonotactic /ob-st ‘(you) praise’ vs. phonotactic Obst ‘fruit’). The 
question that arises is whether the interaction between phonotactics and 
morphonotactics facilitates the processing and acquisition of morpho- 
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notactic clusters or makes it more difficult, or if it makes no difference 
whether a word contains a phonotactic or morphonotactic cluster. In 
contrast to Slavic languages, German has only very few examples of 
morphonotactic clusters which arise through vowel deletion, and all 
clusters involved are homophonous with phonotactic clusters, e.g. in 
the adjective risk-ant ‘risky’ derived from the noun Risiko ‘risk’. For 
a corpus linguistic description of complex German consonant clusters, 
see Dressler and Kononenko in this volume. 

Previous research (Zydorowicz 2007, 2009, 2010; Kamandulyte 
2006; Freiberger 2007, 2014; Korecky-Kroll & Dressler 2015; Zydoro- 
wicz et al. 2015) has shown no greater acquisitional difficulties for mor- 
phonotactic clusters compared to phonotactic clusters in typically devel- 
oping children. Evidence for the facilitation of acquiring morphonotactic 
clusters has been found for Polish and Lithuanian, but not for German. 
Evidence for the ease of processing of German has been divergent (Ko- 
recky-Kroll et al. 2014; Celata et al. 2015; Freiberger et al. 2015). 

The results of the above investigations have led to the hypothesis 
(Zydorowicz et al. 2015) that processing ease correlates with the mor- 
phological richness of the respective language. Morphological richness 1s 
defined as the amount of productive morphology (Dressler 1999, 2004), 
and it has been demonstrated by Xanthos et al. (2011) that its presence 
in child-directed speech supports children in developing and speeding up 
the acquisition of morphology. For processing, our corresponding claim 
refers to Libben’s (2014) principle of maximum opportunity. 

German morphology is relatively poor in inflection, which probably 
explains why we have found no facilitation of the acquisition and pro- 
cessing of morphonotactic clusters, but it is rich in compounding and 
several areas of derivational morphology. 

We examined the influence of (mor)phonotactics in visual word rec- 
ognition in lexical access and word identification (Korecky-Krdll et al. 
2014; Freiberger et al. 2015; Korecky-Kroll et al. 2016; Sommer-Lolei, 
Korecky-Kroll & Dressler 2017) in order to test the Strong Morphono- 
tactic Hypothesis (Dressler & Dziubalska-Kolaczyk 2006), which claims 
that morphonotactic consonant clusters facilitate processing and acquisi- 
tion and lead to higher accuracy because of the significant morphological 
information those clusters carry. We will test this hypothesis for German, 
to see whether it will be supported because morphonotactics facilitates 
word recognition in lexical processing. Or will processing rather be im- 
peded due to the higher processing cost of inflected word forms vs. base 
forms? Furthermore, we hypothesize first that the Strong Morphonotactic 
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Hypothesis may not be equally true for all languages and may depend on 
the morphological richness of a language or its respective morphological 
subsystem; second, that factors other than the often-overestimated fre- 
quency may play a major role in processing. 


2. CONSONANT CLUSTERS IN FIRST LANGUAGE ACQUISITION 


Freiberger (2014) investigated the early acquisition of morphonotactic 
and phonotactic consonant clusters in three typically developing mono- 
lingual toddlers (1 boy, 2 girls) from high SES backgrounds (HSES), who 
were acquiring Standard Austrian German. They were recorded longi- 
tudinally at their homes in Vienna, from 1 year 7 months (1;7) up to 3;0 
years. She divided this period into three’? developmental phases (phase 
1: 1;7-2;0, phase 2: 2;1—2;6, phase 3: 2;7-3;0). All of these data were 
transcribed and coded by using an adapted German version of CHILDES 
(cf. MacWhinney 2000). Freiberger analysed 180 minutes per child and 
phase and investigated all correctly and incorrectly produced consonant 
clusters in the spontaneous speech of these mother-child interactions in 
word-initial, -medial and -final position. 

The results show, as expected, that all children make significant 
progress from the first to the third phase, and that the children have 
more difficulty with word-initial clusters, which are all phonotactic in 
German, than with word-final clusters. This can be attributed to the fact 
that final elements are perceived best, well known as the recency effect 
(cf. Eysenck & Keane 2000). In this study no significant morpheme 
boundary effect was found, which reveals that there is no difference 
between morphonotactic and phonotactic consonant clusters in the early 
acquisition of German. 

In another investigation by Korecky-Kröll et al. (2016) the spontane- 
ous speech of parent-child interactions of 29 typically developing mono- 
lingual and Standard Austrian German-acquiring children from different 
SES backgrounds (HSES vs. lower SES (LSES)) (7 boys and 8 girls of 
HSES, 8 boys and 6 girls of LSES) of the INPUT project’ was analysed. 
The children were video and audio recorded in everyday situations with 
their main caretakers in their homes in Vienna at four data points at the 


> Except for the boy, whose audio recordings already started at age 1;3, and therefore 
have an additional fourth phase (phase 0: 1;3—1;6), cf. (Freiberger 2014: 7). 

° INPUT project: ‘Investigating Parental and Other Caretakers’ Utterances to Kinder- 
garten Children’ (SSH11-027) funded by the Wiener Wissenschafts-, Forschungs- 
und Technologiefonds (WWTF Vienna Science and Technology Fund). 


80 S. Sommer-Lolei, K. Korecky-Kréll, M. Christiner, W. U. Dressler 


mean ages 3;1, 3;4, 4;4 and 4;8. The most interactive 30 minutes per re- 
cording were transcribed, coded and analysed, which is equivalent to 58 
hours of spontaneous speech material. The correct use of Cst (consonant 
+st) clusters, in a medial and final position was examined, in correlation 
to the effects of the data point, SES and morpheme boundary. 

The results demonstrated that the children made significant progress 
from the third to the fourth data point and also showed a significant 
effect of the socio-economic status, since LSES children showed lower 
accuracy. Similarly to Freiberger’s investigation (see above), there was 
no significant influence from the morpheme boundary. 

The overall results show that there are no differences in first lan- 
guage acquisition of German between morphonotactic and phonotactic 
consonant clusters, neither in very young (Freiberger 2014) nor in older 
children (Korecky-Kroll et al. 2016). This is similar to the findings of 
Kirk and Demuth (2005) for English, which is a still more weakly in- 
flecting language than German, and unlike the results presented in stud- 
ies of strong inflecting, morphologically richer languages, such as Polish 
and Lithuanian (cf. Zydorowicz 2007, 2009, 2010; Kamandulyté 2006; 
Zydorowicz et al. 2015), in which the authors showed some differences 
in favour of morphonotactic clusters. This difference might be due to 
the fact that most of the investigated German morphonotactic clusters 
occurred in inflected word forms. Still, Freiberger (2007: 20) demon- 
strated that the presence of a morpheme boundary does not render word 
recognition and production in child speech (CS) more difficult. 

There are two open issues which we would like to address. First, wheth- 
er or not the preference for morphonotactic clusters in certain languages 
or their morphological subsystems depends on the degree of morphologi- 
cal richness; and second, whether the acquisition of distinct consonant 
patterns truly correlates with the presence of a morpheme boundary, or 
whether, instead, the morpheme boundary itself is the important factor, 
regardless of the existence of a consonant cluster. 


3. CONSONANT CLUSTERS IN PROCESSING 
3.1. PREVIOUS EXPERIMENTS 


To test the Strong Morphonotactic Hypothesis for German in adult 
and adolescent processing, four experiments were performed, which 
led to divergent results. A study by Korecky-Kréll et al. (2014) used a 
visual sequence targeting experiment in which 84 native Standard Aus- 


II. The acquisition and processing of (mor)phonotactic consonant clusters 81 


trian German-speaking participants had to detect whether a given stimu- 
lus contained either one letter (T) or a sequence of two letters (ST, AN). 
The experiment was divided into four different tasks: find T (in words 
containing ST); find T (in a word containing only T, but not ST); find 
ST; and find AN. In the first two parts participants had to find T a) in a 
consonant+tst (Cst) sequence (e.g. Obst ‘fruit’ vs. lob-st ‘you praise’) and 
b) in a consonant+t (Ct) combination (e.g. Karton ‘cardboard’ vs. dank- 
te ‘thanked’, Lift ‘elevator’ vs. pack-t ‘(s/he/it) packs’). In the third task, 
participants had to detect ST in a word presented visually (e.g. Stempel 
‘stamp’ vs. brav-ste (good-suP) ‘best’ superlative); and in the fourth, they 
had to find AN (e.g. Fasan ‘pheasant’ vs. vor-an ‘ahead’). Half of the 
stimuli did not contain the respective sequence, whereas the other half 
was divided into subgroups that contained the sequence in different po- 
sitions. For example, in the find T (in T) experiment, 96 stimuli did not 
contain a /t/, whereas 16 stimuli contained a /t/ in the initial position, 16 
in a medial position without morpheme boundary, 16 in a medial posi- 
tion with morpheme boundary, 16 in the final position without morpheme 
boundary, 16 in the final position with morpheme boundary as a default 
and 16 in the final position with obligatory morpheme boundary (after a 
diphthong or long vowel). 

Besides other factors, the authors reported a significant facilitating im- 
pact from the morpheme boundary only in terms of reaction times (RT), 
but not of accuracy (ACC), which indicates processing on a sublexical 
level. 

Celata et al. (2015) examined the processing of morphonotactic and 
phonotactic clusters in German in two different experiments. First, they 
performed a split cluster task, in which 38 adults (29 years and older) and 
26 adolescents (11 to 15 years of age), all native speakers of Standard 
Austrian German, had to create novel diminutives and attenuative forms 
by inserting the vowel /i/ between two consonants (CC—>CiC). They were 
presented with 14 monosyllabic test items ending in a Cst-cluster, half 
containing a morphonotactic cluster, half a phonotactic one, and 56 filler 
words with differing clusters. Thus, they had to transform, for example, a 
Cst-cluster /nst/ into /nist/ or /nsit/ in phonotactic Dunst ‘mist’ (— *Du- 
nist or *Dunsit) and in morphonotactic kenn-st ‘you know’ (— *kennist 
or *kennsit). The result showed an overall preference for /ist/ over /sit/ 
responses and, only within the adult group, high accuracies regardless of 
the type of cluster. For the group of adolescents, the morphonotactic clus- 
ters were significantly easier to split than the phonotactic ones, whereas 
the adult group only showed a trend in favour of the stimuli containing a 
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morpheme boundary. This demonstrates that the presence of a morpheme 
boundary tends to be helpful in connection with a word modifying task. 

Second, a fragment monitoring task was conducted with 28 adoles- 
cents (aged 12 to 16) and 41 adults (aged 25 to 59), again all native speak- 
ers of Standard Austrian German. Participants were visually presented 
with a string in capital letters in the centre of the screen, while hear- 
ing words over headphones. They had to decide as quickly as possible 
whether the auditorily perceived word contained the string presented on 
the screen. In this experiment 30 German words, with a Cst-cluster in 
the mid- or final position, and 75 filler words, containing other clusters, 
served as stimuli (e.g. the Cst-cluster /nst/ in phonotactic Kunst ‘art’ vs. 
morphonotactic kenn-st ‘you know’). 

The results showed that, overall, adults were significantly more ac- 
curate compared to the group of adolescents, which shows again that the 
acquisition of phonology and morphonology is not completely finished 
in adolescence. Despite our expectations, the presence of a morpheme 
boundary had no effect on adult accuracy, although it did have an ef- 
fect in the latency of the younger group, with phonotactic patterns be- 
ing detected faster than morphonotactic ones. Also, the adolescents made 
more errors in morphonotactic items. The authors point to the significant 
impact of frequency in this case, since high-frequency items were more 
often not only judged correctly, but also faster than low-frequency ones in 
the adult group, whereas this frequency difference did not show as much 
of an effect in adolescents. This points to different processing strategies 
in such a recognition task across ages. 

Since the above experiments focused only on the sublexical level, as 
either morphemes, parts of morphemes or clusters that contained mor- 
phemes were processed, there was a strong need to investigate adult lan- 
guage processing on a higher level of language awareness, namely the 
lexical level. 


3.2. EXPERIMENTS ON LEXICAL ACCESS 


In order to investigate the influence of (mor)phonotactics in lexical 
processing and to see whether morphonotactics facilitates or impedes 
visual word recognition, various experiments were performed within our 
research project. 

The findings of all new experiments are first summarized and only 
afterwards are they accurately described. To investigate the processing of 
morphonotactic and phonotactic consonant clusters in whole word rec- 
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ognition, we conducted a progressive demasking task (PDT) and four 
different lexical decision tasks (LDT): 


1. Inflection vs. simplex words (LDT 1) 
2. Progressive demasking task on inflection vs. 
simplex words (PDT) 
3. Compounds vs. simplex nouns (LDT 2) 
4. Derivations vs. simplex nouns (LDT 3) 
5. Compounds and derivations vs. simplex nouns (LDT 4) 


Previous studies on whole-word processing have demonstrated a 
higher processing cost for inflected word forms as opposed to monomor- 
phemic words (e.g. Finnish, cf. Laine et al. 1999) or for inflected base 
forms (e.g. German, cf. Giinther 1988). Therefore, the first experiments 
(LDT 1 and PDT) were created by Freiberger et al. (2015) to test whether 
native speakers of German show similar tendencies and whether they 
are sensitive to the presence of a morpheme boundary within a conso- 
nant cluster (e.g. /gt/, /bl/, /mt/) when the item is an inflectional form, or 
whether the morpheme boundary would not delay processing. 

Both experiments (LDT 1 and PDT) contained inflected word forms 
which are non-citation forms. This suggested that it might be problematic 
to compare a citation form to a non-citation form. Therefore, Korecky- 
Kröll et al. (2016) performed another lexical decision task (LDT 2) in 
order to clarify this issue. Instead of comparing the processing of inflec- 
tional forms to monomorphemic words, compounds, which are citation 
forms like monomorphemic words, were used. Compounding is also a 
morphologically rich domain of German, richer than inflection, which 
is another issue that needed to be addressed. We hypothesized that com- 
pounds would be processed not only more accurately but also much faster 
than monomorphemic words. 

In order to cover all categories of word formation, Sommer-Lolei et 
al. (2017) also conducted an experiment with derived words compared to 
monomorphemic words (LDT 3). Derivations, like compounds, are cita- 
tion forms, but are expected to be harder to process, because compounds 
are morphosemantically more transparent than comparable derivations. 

In our final lexical decision task 4 we combined the two previous 
experiments (LDT 2 and 3) into one, considering two newly introduced 
variables, namely familiarity and foreignness (see also 3.7). 
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3.3. METHODOLOGY 


To investigate lexical access, four lexical decision tasks were per- 
formed. The experiments (LDT 1, 2 and 3) were designed using the be- 
havioural research software E-Prime 3.0 (Psychology Software Tools, 
Pittsburgh, PA) and were all carried out on the same Windows laptop 
(Freiberger et al. 2015; Korecky-Kröll et al. 2016; Sommer-Lolei et al. 
2017). LDT 4 was an online experiment, programmed and provided by 
URL: https://quest.christiner.at/. Due to the high number of participants 
that were all tested in one day, we tested on several Windows computers 
simultaneously. All of the test items were presented to the participants 
visually, capitalized in the centre of the screen, following a fixation cross. 
Participants had to decide as accurately and quickly as possible whether 
the presented string was an existing German word or not. The relevant 
keys were marked with a green sticker for an affirmative response, and a 
red sticker for a negative response. All reaction times measured in LDT 1, 
2 and 3 that were below 300 ms or exceeded 2500 ms were excluded from 
the analysis. For LDT 4 reaction times were measured but considered as 
not reliable due to the high number of different computers used at the 
same time and intermittent connectivity issues. 

The progressive demasking task (PDT) was used to test word identi- 
fication. The experiment was designed using the PDT software (Dufau, 
Stevens & Grainger 2008). The cut-off values in this experiment were 
< 300 ms and > 3500 ms. Participants had to identify on their computer 
screen a slowly demasking stimulus and to confirm identification of it as 
soon as possible by pressing a key. As a result, the word disappeared from 
the screen and they had to type the previously identified word as quickly 
as possible. The same Windows laptop was used to conduct the experi- 
ment as in LDT 1, 2 and 3. 

The choice of stimuli for LDT 4 was established as a consequence of 
two ratings that revealed the degree of familiarity and foreignness of the 
test items previously used in LDT 2 and 3. In the first rating of compounds 
vs. monomorphemic words, participants got a list of 96 items and had to 
decide whether or not the word on the list was a foreign word, and they 
had to judge spontaneously how familiar they are with this word. While 
the foreignness rating was a decision task (Yes or No), the familiarity rat- 
ing had a range on a scale from 1 (well known) up to 5 (unknown). This 
was conducted using an interactive PDF (Adobe Acrobat DC). Each rating 
was either forwarded immediately to the investigator via email or all rat- 
ings filled in by hand on a printed form were handed in by the participants. 
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The second rating of derivations vs. monomorphemic words used an on- 
line questionnaire, programmed and provided by Markus Christiner on 
an online platform, URL: https://quest.christiner.at/. Similarly to the first 
rating, participants had to decide on the foreignness and familiarity of 
96 stimulus words that were presented one after the other, whenever the 
participant clicked on the Weiter ‘next/go on’ button. It was not possible 
to measure reaction times. 


3.4. MATERIALS 


In each of the tasks the experimental items were 96 German words and 
96 German-based non-words (one letter of a German word was changed 
in the monomorphemic words, two letters were changed in the compound 
and derivation-based items, i.e. one consonant or vowel in each of the two 
morphological parts), with the exception of LDT 4, in which the stimuli 
were 96 German words and 32 German-based non-words, divided into 
four conditions, as presented in Table 1: 


Table 1. Conditions of the test stimuli 


Condition Explanation 
l M+P+ | containing a consonant cluster that crosses the morpheme boundary 


M+P- | containing a morpheme boundary, but no consonant cluster 


2 
3 | M-P+ | monomorphemic word containing a consonant cluster 
4 


M-P- monomorphemic word without a consonant cluster 


This results in 24 words and 24 non-words per condition. Half of 
the items contained a morpheme boundary, while the other half did not, 
which is exactly the same as the items containing or not containing a con- 
sonant cluster. In LDT 1 and the PDT (Freiberger et al. 2015), stimulus 
words contained only biconsonantal clusters (e.g. /gr/, /br/), whereas in 
LDT 2 (Korecky-Kröll et al. 2016), LDT 3 (Sommer-Lolei et al. 2017) 
and LDT 4, bi- and triconsonantal clusters occurred (e.g. /rtn/, /Ist/). The 
position of the consonant clusters was either in the final (LDT 1) or in a 
medial position (LDT 2, 3 and 4). For every morphonotactic consonant 
cluster there existed a phonotactic match (e.g. LDT 2 on compounds: 
(M+P+) Haus+tier ‘domestic animal’ vs. (M—P+) Kastanie ‘chestnut’ ). 
Therefore, we also matched conditions 2 and 4 (e.g. LDT 3 on deriva- 
tions: (M+P-—) Zeig-er ‘pointer’ vs. (M—P—) Lager ‘storage’). Examples 
of stimuli for each experiment are listed in Table 2; for a summarizing 
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overview of the tasks, see Tables 3 and 4. All of the items were balanced 
for word length, syllables and average word frequency (taken as the num- 
ber of occurrences from the CELEX, the Austrian Media Corpus (AMC?) 
and also the Leipzig Deutscher Wortschatz Online databases). 


Table 2. Examples for German words used in the experiments per condition 


Task | LDT 1 & PDT LDT 2 LDT 3 
(Inflection) (Compounding) (Derivation) 


LDT 4 (Compounding and Derivation) 


Cond. 1 (M+P+) wag-te ‘dared’ Haus+tier ‘domes- | Eitel-keit ‘vanity’ 
tic animal’ 


triconsonantal Hals+tuch ‘scarf’ Gärt-ner ‘gardener’ 
clusters 


Cond. 2 (M+P-) heb-en ‘lift’ Tee+tasse ‘teacup’ | Zeig-er ‘pointer’ 


Cond. 3 (M—P+) Zimt ‘cinnamon’ Kastanie ‘chestnut’ | Balsam ‘balm’ 


triconsonantal - Holster ‘holster’ Partner ‘partner’ 
clusters 


Cond. 4 (M-P-) 


Mücke ‘mosquito’ | Rakete ‘rocket’ Lager ‘storage’ 


3.5. PARTICIPANTS 


All the participants were adult monolingual native speakers of Stan- 
dard Austrian German. None of them reported visual or neurological im- 
pairments or a history of language disorders. 

In LDT 1, in which processing of inflectional forms was compared to 
monomorphemic word processing, 46 adults (aged 19 to 35) participated. 

In the progressive demasking task, in which the same inflectional 
forms and monomorphemic words taken from the previously conducted 
LDT 1 had to be identified, 45 adults (aged 19 to 31) participated, like- 
wise in LDT 2, in which the processing of monomorphemic words was 
compared to the processing of compounds and in LDT 3, where the pro- 
cessing of derivations compared to monomorphemic words was investi- 
gated. 

The first rating of foreignness and familiarity with regard to the list of 
compounds and monomorphemic words was conducted with 130 adult 
participants (aged 18 to 51); the second rating of the stimulus word list 


7 The Austrian Media Corpus of the Austrian Academy of Sciences, based on APA 
(Austrian Press Agency) data, consists of over 10 billion word tokens. URL: https:// 
www.oeaw.ac.at/acdh/tools/amc-austria-media-corpus/ [20.03.2019]. 
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of derivations and monomorphemic words was performed by 102 adults 
(aged 20 to 59). Both ratings were used for stimulus word selection in 
LDT 4. 

In LDT 4, as our most recently performed experiment, in which 84 
adults (aged 18 to 36) participated, we united the previous two experi- 
ments LDT 2 and 3 in order to investigate the processing of compounds 
and derivations compared to monomorphemic words, in consideration of 
the familiarity and foreignness ratings of the stimulus words, where we 
selected in equal parts very familiar, familiar, very unfamiliar, very for- 
eign, foreign and non-foreign stimulus words (compounds, derivations 
and monomorphemic nouns). 

For a summary of all the performed tasks, ratings and participants, see 
Tables 3, 4 and 5: 


Table 3. Overview of the visual word recognition tasks on inflection 


co e 


Computer Win. Laptop Win. Laptop 
Cut off values ms < 300 > 2500 < 300 > 3500 
PNo.simutus non-words [56 Js | 


Table 4. Overview of the lexical decision tasks on compounding and derivation 


ES 
Computer Various Computers 
Yes 


Yes 
Cut off values ms < 300 > 2500 < 300 > 2500 RT not reliable 


No. stimulus words |96 l 


RTs measured Yes 
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Table 5. Overview of the familiarity and foreignness ratings 


Comp. and simplex words 
No. stimuli 96 C 
No. participants (age) 130 (18-51 yrs.) 
Method interactive PDF 
(Adobe Acrobat DC) 
Familiarity (scalar) 1 (well known) to 5 (unknown) 
Foreignness (nominal) Yes or No 


3.6. RESULTS OF THE PROCESSING EXPERIMENTS 


In the first whole-word recognition tasks, conducted by Freiberger 
et al. (2015), divergent results were found. Regarding the LDT 1 and 
the PDT on inflection, they found that in both experiments words with a 
morpheme boundary were significantly more difficult to process than all 
the other categories. In particular, the category M+P+, which are strings 
that contain a morpheme boundary and a consonant cluster, showed the 
highest latency, and in the lexical decision task also the lowest accuracy. 
This is in accordance with processing models which assume that affixed 
words are decomposed into base form and affix, which leads to higher 
processing costs. They did not find this accuracy result in the PDT; in- 
stead words without a morpheme boundary and with a consonant cluster 
(M-—P+) were processed more accurately. The longer reaction times for 
the M+P+ items can be explained by the fact that these strings were 
all inflectional forms. Therefore, Freiberger and colleagues concluded 
that it is problematic to compare a non-citation form with a citation 
form, as already mentioned above, but also, this may be due to the fact 
that German is only a weakly inflecting language, in which inflectional 
morphology is not important enough to facilitate lexical processing. In 
the strongly inflecting language Polish, Zydorowicz and Dziubalska- 
Kolaczyk (2017) found that the morpheme boundary helped processing. 
The results furthermore show that it is not the cluster that renders word 
recognition more difficult, but rather the morpheme boundary combined 
with the cluster. In this case the Strong Morphonotactic Hypothesis 
could not be confirmed for German, although there had been some evi- 
dence for it on the sublexical level in previous experiments (see 3.1). 

In order to verify these findings and to test the hypothesis concerning 
the impact of citation forms, Korecky-Kroll et al. (2016) conducted the 
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second experiment, LDT 2, on compounding. Since inflected word forms 
were included in the first experiments (Freiberger et al. 2015, see above), 
we wanted to see whether words that are citation forms, like monomor- 
phemic words, and that derive from a morphologically rich domain of 
German, much richer than inflection, would lead the participants to a 
different behaviour. The results for accuracy showed that compounds 
with morphonotactic clusters (M+P+ items, with morpheme boundary 
and consonant cluster), but also M+P-— items (compounds without a con- 
sonant cluster) show significantly higher accuracy than either type of 
monomorphemic words. This diverges from the results of the previous 
lexical decision experiments on inflection, and therefore supports the 
Strong Morphonotactic Hypothesis. The results for latency showed no 
significant difference, but a trend in favour of M+P+ compounds (with 
morpheme boundary and with consonant cluster). 

These results have demonstrated that the decomposition of compounds 
does not slow down processing but is, rather, automatic, which supports 
Libben’s (2014) principle of maximum opportunity. 

Korecky-Kroll et al. (2016) therefore concluded that the facilitation 
process in the acquisition and processing of morphonotactic clusters only 
seems to apply in a language or linguistic domain that is morphologically 
rich, and it was suggested that the Strong Morphonotactic Hypothesis 
be modified accordingly (also in Sommer-Lolei et al. 2017). Thus, the 
interaction with morphology appears to facilitate processing where it “is 
worth it”. 

The third lexical decision experiment on derivational morphology 
(LDT 3) was performed by Sommer-Lolei et al. (2017) in order to sup- 
port our theoretical claim that the morphological richness of a certain 
area facilitates processing, 1.e. by also investigating the derivational do- 
main. 

The results of the LDT 3 demonstrated that M+P-— items in particular 
(derivations containing a morpheme boundary, but no consonant clus- 
ter), but also M+P+ items (derivations containing a morpheme boundary 
and a consonant cluster), yield a significantly higher accuracy and are 
processed significantly faster than both types of monomorphemic words. 
Derived nouns (derived via productive word-formation rules) were pro- 
cessed more accurately than simplex nouns. Unlike the previous experi- 
ment on compounding (Korecky-Kroll et al. 2016, see above), these re- 
sults do not support the Strong Morphonotactic Hypothesis directly but 
only indirectly, insofar as we found a positive effect on processing when- 
ever a morpheme boundary is present. 
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To demonstrate differences in the processing of morphonotactic and 
phonotactic consonant clusters in compounds, derivations and monomor- 
phemic words (LDT 4), a repeated measures ANOVA was performed on 
the mean values of the correct responses. The results show, with a Green- 
house-Geisser correction, that words containing a morpheme boundary 
were processed significantly differently F(2.48,195.73) = 30.76, p < 0.01, 
regardless of whether the string contained a consonant cluster or not. Con- 
dition 1 (M+P+ Mean = 0.85) was significantly different from conditions 
3 (M-P+ Mean = 0.79) and 4 (M—P-— Mean = 0.79) but not from condition 
2 (M+P— Mean = 0.84). M+P+ and M+P- items were significantly differ- 
ent from both categories without a morpheme boundary. 


3.7. RESULTS OF THE FAMILIARITY AND FOREIGNNESS RATINGS 


In our lexical decision tasks on compounds (LDT 2) and derivations 
(LDT 3) the high number of foreign words within the group of mono- 
morphemic words (e.g. Taifun ‘typhoon’, Baklava ‘baklava’) that were 
used within the experiments still seemed problematic to us as a possible 
confounding factor. Use of these items was due to the necessity of match- 
ing compounds and derivations to monomorphemic words in frequency 
and word length. 

Therefore, we had 130 German native speakers rate all our stimulus 
words for the compound experiment in terms of the degree of familiarity 
and foreignness, which revealed that the four categories of stimuli dif- 
fered significantly in their perceived degree of foreignness (see Figure 1), 
whereas in the familiarity rating only monomorphemic words without 
consonant clusters (M—P-—) were rated significantly less familiar than all 
other categories (see Figure 2): 


M-P- M-P+ M+P- M+P+ 


Figure 1. Degree of foreignness in the four categories 
(Rating of stimulus words for LDT 2) 


Il. The acquisition and processing of (mor)phonotactic consonant clusters 91 


10 15 20 25 30 35 40 


M-P- M-P+ M+P- M+P+ 


Figure 2. Degree of familiarity in the four categories 
(Rating of stimulus words for LDT 2) 


The results for compounds and monomorphemic words show that 
foreignness and, especially, unfamiliarity reduce accuracy and delay the 
speed of processing significantly. 

Although familiarity and foreignness were less likely to constitute in- 
tervening variables for derivations than for compounds, we nevertheless 
had the 96 stimulus words rated for this experiment by 102 native speak- 
ers of German, which revealed a significant difference between the cat- 
egories M— and M+ regarding the degree of foreignness (see Figure 3). In 
the familiarity rating we found no significant difference between the four 
categories (see Figure 4). 


M-P- M-P+ M+P- MP+ 


Figure 3. Degree of foreignness in the four categories 
(Rating of stimulus words for LDT 3) 
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Figure 4. Degree of familiarity in the four categories 
(Rating of stimulus words for LDT 3) 


4. STATISTICAL ANALYSES 


4.1. IMPACT OF FREQUENCY, FAMILIARITY AND FOREIGNNESS ON 
ACCURACY 


As mentioned above, we hypothesized that familiar words, ranked in 
advance by native speakers of Standard Austrian German (see also 3.4), 
would be processed significantly more accurately than unfamiliar words 
and second, that stimulus words judged as foreign would result in signifi- 
cantly lower accuracy than all of the other words. 

To demonstrate the effects of frequency, familiarity and foreignness in 
adult language processing we analysed their impact in terms of accuracy 
of words (non-words were excluded) in LDT 2 (compounds vs. simplex 
nouns), LDT 3 (derivations vs. simplex nouns) and LDT 4 (compounds 
and derivations vs. simplex nouns). 

As for LDT 2 (Korecky-Kröll et al. 2016), statistical analyses of the 
accuracy of words reveal that words with a high AMC token frequency 
are significantly more likely to be judged correctly and that high-frequen- 
cy words containing a morpheme boundary, especially M+P+ items, are 
processed significantly more accurately than words without one. In terms 
of familiarity and foreignness, we found that stimulus words that are ei- 
ther unfamiliar and/or foreign have significantly less accurate results than 
familiar and non-foreign words. In addition, there is no difference in this 
respect in the processing of morphonotactic and phonotactic consonant 
clusters across the four categories. 

In our LDT 3 on derivations compared to monomorphemic nouns 
(Sommer-Lolei et al. 2017), words containing a morpheme boundary 
have significantly more accurate results compared to monomorphemic 
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words, regardless of the absence or presence of a consonant cluster. Also, 
we found that words without a consonant cluster and without a morpheme 
boundary (M-—P-) are processed significantly less accurately than M—P-+, 
which demonstrates a processing strategy of preferring a phonotactic 
consonant cluster whenever no morpheme boundary is present. In other 
words, the presence of a morpheme boundary leads to significantly higher 
results than the presence of a cluster. This indicates the relevance of mor- 
pheme boundaries. 

Similar to our finding in LDT 2, we found, with regard to familiarity 
and foreignness, that stimulus words that are either unfamiliar or foreign 
or both have significantly less accurate results than familiar/non-foreign 
words. In the case of familiarity of items, there is no difference in the 
processing of morphonotactic and phonotactic consonant clusters. How- 
ever, when items are foreign, we found that words with a phonotactic 
consonant cluster (M—P+) are processed significantly more accurately 
compared to all other categories. 

This appears to mean that (rather) foreign words of limited length are 
expected to be monomorphemic words, and this respects the fact that 
more simplex words are loaned than affixed words, in contrast to loaned 
English compounds. 

By merging the previously conducted two experiments into one new 
experiment with respect to the degree of familiarity and of foreignness 
of the stimulus words, we found, with regard to the impact of token fre- 
quency, that words containing a morpheme boundary were processed 
with significantly greater accuracy compared to monomorphemic words, 
with M+P- items scoring highest. The processing of unfamiliar or foreign 
words in LDT 4 shows that these items are processed significantly less 
accurately, as in the previously conducted LDT 2 and 3. For familiarity 
we found no significant difference between the four categories. Foreign 
words containing a morpheme boundary have significantly less accurate 
results than words without it. 

Statistical analyses of accuracy undertaken by means of a repeated 
measures ANOVA reveal differences in the processing of the three cat- 
egories (compounds, derivations, simplex nouns). The results show that 
each of the three categories led to highly significant differences from each 
other: F(2,158) = 35.29, p < 0.01. Independently of whether the item 
was a word or a non-word, compounds were processed with significantly 
more accuracy than simplex nouns, whereas derived words resulted in an 
intermediary position, significantly different from both compounds and 
monomorphemic nouns (see Figure 5). 
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0.78 
0.76 


0.74 
Compound Derivation Simplex 


Figure 5. Mean values of accuracy in processing in compounds, 
derivations and simplex words 


When analysing our data (LDT 2, 3 and 4) with regard to familiar- 

ity by using general linear mixed-effects models (Bates et al. 2015), we 
found that very unfamiliar words are processed with significantly less 
accuracy in all three experiments, but that there were no significant differ- 
ences with regard to the presence vs. absence of a morpheme boundary or 
consonant cluster or both. Familiarity is shown to be a highly influential 
factor when it comes to compounds (LDT 2 and 4), whereas in LDT 3 
(derivations vs. simplex nouns), familiarity is an intervening variable to a 
lesser extent (see Table 6). 
It is a novel finding, as first presented by Sommer-Lolei et al. (2017), 
that familiarity is a more important variable in compounds than foreign- 
ness, which itself is more important than token frequency. The results 
for LDT 2 show that foreignness is much closer to frequency than to 
familiarity. Therefore, we can conclude that although all three variables 
are significant in respect of processing, familiarity is the most important 
factor when dealing with compounds, which is also a warning against 
relying excessively on frequency. Instead, other variables, in particular 
familiarity, should be considered as well. 

Thus, results point to the fact that it is important whether a given string 
is a citation form or not, and, additionally, whether the item contains a 
morpheme boundary is rather important, although we only found effects 
of morphonotactic consonant clusters in LDT 2 and 4, which demon- 
strates the strong impact of compounds. 

Overall, the presence of a morpheme boundary facilitates word recog- 
nition and processing except for unfamiliar and foreign words, regardless 
of whether there is a consonant cluster or not. 
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Table 6. Hierarchy of influencing factors on the accuracy of German stimulus words 


Accuracy of words in: 
z values and significance levels’ | -14.007*** -4.935*** sol tes 
1906.541 1996.92 2006.647 


z values and significance levels | -12.573*** 8.9017 ** -5.807*** 
AIC values | 3493.344 3529.927 3559.36 


4.2. INFLUENCE OF FREQUENCY, FAMILIARITY AND FOREIGNNESS 
ON LATENCY 


Reaction times were measured for LDT 2 and 3, but as already men- 
tioned above, this was impossible for LDT 4. The results demonstrate in 
both experiments that frequency has an impact in the sense that words 
with a high token frequency in the AMC corpus are processed significant- 
ly faster, regardless of whether the word contains a morpheme bound- 
ary and/or a consonant cluster or not. Thus, the positive influence of a 
morpheme boundary on accuracy has no correspondence in latency. In- 
terestingly, the interaction of morphology with phonological processing 
appears to increase accuracy, because accuracy is monitored on one ad- 
ditional level, whereas it neither slows down nor accelerates processing 
significantly. 

In terms of familiarity, we find that unknown or very unfamiliar words 
were processed significantly slower in both experiments. In LDT 2 (com- 


* The following significance levels were selected: * 0.05, ** 0.01, *** 0.001. Please 
note that the negative z values for the foreignness and familiarity rating are due to 
the coding. For foreignness, this is more intuitive: if a word was rated as being not a 
foreign word, this was coded as 1, whereas 2 indicated that it was rated as a foreign 
word. Therefore, as expected, the accuracy of the participants is higher if the word is 
less foreign. However, the familiarity rating may appear somewhat counterintuitive 
as it was, instead, an unfamiliarity rating, ranging from 1 (well known or familiar) 
to 5 (unknown or unfamiliar). This leads to the negative z values: the participants’ 
accuracy is higher if a word is less unfamiliar (i.e. more familiar). 

AIC refers to the Akaike Information Criterion (see e.g. Levshina 2015: 149), which 
was used as the primary criterion for model selection: the smaller the AIC value, the 
better the fit of the respective model. 


Ke] 
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pounds vs. simplex nouns), the analysis ofreaction times shows that words 
containing a morpheme boundary but no consonant cluster (M+P-) delay 
processing significantly, followed by words with a morpheme boundary 
and cluster (M+P+), which show a weaker effect on latency. In LDT 3 
(derivations), we did not find significant differences of this type. 

Regarding the degree of foreignness, the analysis reveals that highly 
foreign words are processed significantly slower in both experiments 
which is also the case for stimulus words containing a morpheme bound- 
ary (with or without consonant cluster). Analysis of LDT 2 (compound- 
ing) shows that words with a morpheme boundary that also contain a con- 
sonant cluster (M+P+) delay processing with high significance, followed 
by words with a morpheme boundary and without a consonant cluster 
(M+P-), which tend to be processed slightly faster. Interestingly we find 
the opposite picture when analysing data from our LDT 3, which means 
that words with a morpheme boundary but without a consonant cluster 
(M+P-) are processed significantly more slowly than the M+P+ items. 
This points to the fact that the presence of a morphonotactic consonant 
cluster delays processing in foreign compounds but only shows a weak 
effect on the latency of foreign derivatives. 

As summarized in Table 7, we conclude that with regard to the latency 
of compounding, familiarity is the major influencing factor, whereas it is 
frequency that plays a highly important role when processing derivations. 


Table 7. Hierarchy of influencing factors on the latency of German stimulus words 


Latency of words in: 


LDT 2 (Comp. vs. Simplex) Familiarity > 
z values and significance levels | 8.577*** 

AIC values | -5732.506 
LDT 3 (Deriv. vs. Simplex) Frequency > 


z values and significance levels | -12.340*** 8,3 70*** 63135A 
AIC values | -6361.651 -6324.792 -6308.46 


Morpheme boundaries tended to be helpful on the sublexical level, as 
found by Celata et al. (2015) in the split cluster task, where frequency 
also had an important impact (see 3.1). By contrast, our results on lexical 
processing in terms of latency show significant delays in the presence 
of morpheme boundaries in unfamiliar and/or foreign words, whereas 
morpheme boundaries were helpful in familiar and non-foreign words. 
Frequency was always helpful, but often much less so than familiarity. 
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5. CONCLUSION 


As for German, no effect of the morphonotactic character of conso- 
nant clusters is shown in inflection. Therefore, the Strong Morphonotactic 
Hypothesis is not supported for German inflection, neither in first lan- 
guage acquisition nor in adult or adolescent language processing. 

The Strong Morphonotactic Hypothesis could only be supported for 
German compounding, where the strongest facilitating effect was found 
for morpheme boundaries with consonant clusters. However, in deriva- 
tives too, positive effects of morpheme boundaries (with and without 
consonant clusters) on processing were found. Nevertheless, when com- 
pounds, derivatives and monomorphemic words were directly compared 
within the same participants, compounds showed significantly higher 
levels of accuracy than derivatives. This points to the second compound 
constituent being more readily identifiable, due to its coexistence as an 
autonomous lexical element, compared to the harder process of retriev- 
ing suffixes (inflectional or derivational suffixes). In processing, this is 
a consequence of the process of chunking elements (here phonemes and 
graphemes). Apparently morphological chunking is one of the normal 
processing strategies. Therefore, it is desirable to conduct further experi- 
ments in which morpheme and syllable chunking can be compared. 

Words are processed faster and significantly more accurately, the more 
familiar a stimulus word is (particularly in compounding) and, to a lesser 
extent, the more frequent it is (particularly in derivations). This greater 
effect of familiarity can be linked to Libben’s (2014) principle of oppor- 
tunity: in compounds, the familiarity of both the whole compound and of 
the word families of its constituents facilitates processing, whereas the fa- 
miliarity of the more abstract, 1.e. much less morphosemantically descrip- 
tive suffixes must have a much smaller influence. As a consequence, the 
frequency and productivity of suffixes has a relatively greater importance. 
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IV. Exploring phonotactic and morphonotactic 
constraints in the acquisition of consonant 
clusters in L1 French 


BARBARA KOPKE! 
OLIVIER NOCAUDIE! 
HELENE GIRAUDO? 


1. INTRODUCTION 


Consonant clusters are relatively rare in the languages of the world 
and count as phonologically marked (see e.g. Greenberg 1965; Clem- 
ents & Keyser 1983; Maddieson 1984; Vennemann 1988; Blevins 1995; 
Dziubalska-Kotaczyk 2009). Nevertheless, they do figure prominently in 
a number of languages such as German or Polish. The fact that most of 
these languages are inflecting-fusional ones and that many of the clusters 
attested in them occur at morpheme boundaries suggests that morpho- 
logical factors might be involved in the emergence and first language 
(L1) acquisition of clusters. In particular, the possibility that clusters may 
function as boundary signals (cf. already Trubetzkoy 1939) is likely to 
play a role. 

The present study focuses on the time of emergence, position and 
phonotactic vs. morphonotactic status of consonant clusters (henceforth 
CC) in the acquisition of L1 French. In order to benefit from the grow- 
ing number of resources made available by the scientific community, we 
selected a corpus out of the CHILDES database (MacWhinney 2000), 
allowing for comparison with L1 acquisition data collected to test the 
Strong Morphonotactic Hypothesis (SMH) for German (Freiberger 2007, 
2014). We then analysed longitudinal data from four children aged 1;6 
to 3;0 collected in spontaneous speech interactions between a parent and 
the target child with a generalized linear mixed model investigating the 
role of the factors age, position and phonotactic (PH) vs. morphonotactic 
(MPH) status on the successful pronunciation of the different CCs. 


' LNPL — Laboratoire de NeuroPsychoLinguistique, University of Toulouse (UT2), 
Toulouse, France. 
2 CNRS, CLLE-ERSS, University of Toulouse (UT2). 
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2. CONSONANT CLUSTERS IN FIRST LANGUAGE ACQUISITION 


2.1. PHONOTACTICS AND CONSONANT CLUSTERS 


A number of studies have investigated the role of phonotactics in the 
acquisition of the L1. It is generally assumed that phonotactics provide 
cues allowing the child to identify word boundaries in speech, hence con- 
ferring a key role to phonotactics in word segmentation. This is confirmed 
by growing evidence about how the acquisition of phonotactics boot- 
straps the acquisition of lexicon and grammar (cf. Boll-Avetisyan 2012). 
Moreover, phonotactics also play a role in adult language processing: for 
instance, McQueen (1998) showed that adults recognize words more rap- 
idly when the junction between two words forms a phoneme cluster that 
does not typically occur within words. Such findings draw attention to the 
role of CCs and their significance in phonotactics since their nature and 
distribution varies a lot across languages. 

The production of CCs has been shown to be particularly difficult in 
L1 acquisition, in accordance with the idea that more marked structures 
(Trubetzkoy 1939) will be more difficult to acquire than less marked 
ones. However, it has also been shown that the ease of acquisition of CCs 
is not homogenous and depends on a number of parameters such as syl- 
lable structure, position, frequency, morphology and input factors (e.g. 
Demuth & McCullough 2009), which also differ across languages. In the 
same vein, Levelt, Schiller and Levelt (2000) have shown that the order 
of acquisition of syllable structures “closely matched the frequency with 
which those syllable structures occurred in child-directed speech” (cited 
by Demuth & McCullough 2009: 427). 

Structural aspects of phonotactics have been comprehensively de- 
scribed in the Beats-and-Binding model of Dziubalska-Kotaczyk (2002), 
providing a scale of preference for markedness based on overall sonor- 
ity as composed of sonority, place of articulation and voicing. Studies 
by Demuth and collaborators on L1 acquisition also take into account 
sonority (e.g. Demuth & McCullough 2009). In addition, it has been sug- 
gested that syllable-initial consonants are less marked than syllable-final 
consonants and hence easier to acquire, the former also being present in 
a larger number of languages. The same should hold for CCs in an initial 
position, which should be easier to acquire than CCs in a final position. 
However, for analytic languages such as English, the contrary seems to be 
true, which has been explained through frequency and syllable structure 
effects, among other things (see Demuth 2007; or Demuth & McCullough 
2009, for reviews). Data for Dutch (where word-initial and word-final 
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clusters are about equally frequent) suggest that some children produce 
clusters more easily in the initial position and others in the final position. 

These data suggest a strong link between the acquisitional process and 
the structure of the target language. Demuth and McCullough (2009) have 
challenged these predictions with respect to French, based on the idea that 
analyses of child-directed speech show that around 70% of CCs are in the 
word-initial position, predicting a frequency advantage for word-initial 
clusters. This hypothesis is investigated with the analysis of the speech 
production of two children (Tim and Marie from the Lyon Corpus re- 
corded by Demuth & Tremblay 2008) recorded repeatedly between ages 
1;5 and 3;0. The results show higher accuracy in the production of word- 
initial CCs than word-final CCs as predicted, and this independently of 
syllable structure factors such as sonority. The study furthermore showed 
that word-medial CCs were produced with roughly the same accuracy as 
word-initial CCs in French. 


2.2. MORPHONOTACTICS AND CONSONANT CLUSTERS 


Apart from frequency and structurally based factors, morphological 
structure has also been supposed to play a role in the acquisition of CCs. 
For example, Kirk and Demuth (2005) examined the production of CCs 
in an elicitation task in English-speaking two-year-olds and found that the 
children were more accurate in producing word-final clusters consisting 
of obstruent +/s/ compared to word-initial clusters (e.g. cups vs. spoon) 
and, most interestingly, to final clusters with the same but reversed seg- 
mental content (e.g. cups vs. wasp). The authors explain these results 
through both input-related and morphological factors: obstruent +/s/ clus- 
ters are more frequent, but most importantly (and probably linked to the 
first factor), they involve an inflectional (and very productive) morpheme. 
This could lead to a morphological advantage for final clusters, which, in 
turn, may help children acquiring languages such as English to focus very 
early on complexity at the end of the words. Furthermore, morphological 
effects appear very early in language perception and production. Numer- 
ous studies suggest that infants start perceiving functional morphemes at 
an early age. They distinguish forms of function words (free morphemes) 
from content words and decode specific function words during the first 
year of life (e.g. Shi & Werker 2003; Shi, Werker & Cutler 2006; Shi, 
Werker & Morgan 1999). 

A recent study conducted by Marquis and Shi (2012) provided the 
first empirical evidence that French-learning 11-month-olds can use the 
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encoded bound morphemes for interpreting the internal units of newly 
encountered words. They demonstrated that infants analyse the word- 
internal morphology during their first year of life before learning word 
meaning, and that the decoding of a bound functional morpheme depends 
on the morpheme frequency. This finding suggests that rudimentary rep- 
resentations of morphological alternations emerge very early in the long- 
term memory of infants. Finally, in an application of the Beats-and-Bind- 
ing model to L1 acquisition, Zydorowicz (2007, 2009) has shown that 
Polish-speaking children reduce morphonotactic clusters less frequently 
than phonotactic ones, similarly to what has been shown for Lithuanian 
and English (Kamandulyté 2006; Kirk & Demuth 2005). 

Despite such promising results, investigations of how morphotactics 
facilitate the acquisition of phonotactics in children are still rare. Howev- 
er, Dressler and Dziubalska-Kolaczyk (2006) and Dressler, Dziubalska- 
Kotaczyk and Pestal (2010) have elaborated a model of morphonotactics 
and its correspondence to phonotactics that has been tested in preliminary 
studies on L1 acquisition of German (Freiberger 2014). The Strong Mor- 
phonotactic Hypothesis (SMH) issued within this framework assumes 
that typically developing children should acquire morphonotactic clusters 
earlier than comparable purely phonotactic ones, because morphonotactic 
clusters are likely to be expressed more consistently in the input to which 
the children are exposed. Moreover, their segmental constituents will also 
occur independently of each other in phonologically less marked con- 
figurations. The investigation of morphonotactic clusters acquired dur- 
ing different acquisitional phases of productive morphology (cf. Bittner, 
Dressler & Kilani-Schoch 2003) should allow us to establish which of 
the two factors plays a more important role. The framework also assumes 
that morphonotactic clusters with many phonotactic counterparts lack a 
morphological signalling function. Moreover, they may often be affected 
by the same repair mechanisms as parallel phonotactic clusters and ex- 
pressed less faithfully in the adult speech input than morphonotactic clus- 
ters with few or no phonotactic counterparts. Therefore, they also ought 
to be acquired less easily and during later phases. Nevertheless, adult-like 
production of morphonotactic clusters may precede (in terms of the first 
emergence or frequency of occurrence) the production of homophonous 
purely phonotactic clusters. 

Furthermore, phonologically more marked clusters should be acquired 
later than less marked ones (independently of whether they are phono- 
tactic or morphonotactic). Still, it needs to be investigated how the ac- 
quisition of morphonotactic clusters is affected by the absence or pres- 
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ence and also the frequency of parallel phonotactic ones. The framework 
also stresses that the measurement of markedness should differentiate 
between different positions in the word and give much consideration to 
ease of perception (such as the Net Auditory Distance among segments, 
as proposed by Dziubalska-Kotaczyk 2009). 

These assumptions have been investigated by Freiberger (2014) 
through the analysis of a longitudinal corpus of spontaneous speech data 
from 3 typically developing monolingual children acquiring German as 
an L1 in Austria that was analysed with respect to the interaction of pho- 
notactic and morphotactic factors. For each child, the researcher selected 
30 minutes of recordings per month between age 1;6 and 3;0°. Three de- 
velopmental phases were distinguished during this period. The data were 
transcribed according to CHILDES norms, all spontaneously produced 
clusters were extracted, and correctly vs. incorrectly pronounced clusters 
were analysed with respect to the number of consonants, position, mor- 
phonotactic vs. phonotactic status and number of morphological bound- 
aries. The results showed the expected progression in accuracy with age 
and that the children had more difficulties with initial than with medial 
and final clusters (similarly to what has been shown for English). While 
morphonotactic clusters did not involve additional difficulties due to their 
complexity, there was no interaction between morphological and phono- 
logical factors as expected. 

Given the specificities in the acquisition of morphonotactic CCs to be 
expected in different languages varying with respect to inflectional pat- 
terns and phonological typology, the present study seeks to complement 
existing data on the acquisition of CCs in Polish, German, English and 
Lithuanian with data from French that, as a Romance language, can be 
expected to show a different acquisitional path. While some data exist on 
phonotactic factors, previous studies on CC acquisition in French have 
not taken morphological factors into account. 


3. METHODOLOGY 


The aim of the present study was to explore the acquisition of CCs 
by L1 French children, their time of emergence and speed of acquisiti- 
on, as well as to scrutinize aspects of the SMH (Dressler & Dziubalska- 
Kotaczyk 2006) in L1 French acquisition. In order to provide data that 


3? One of the children was recorded from age 1;3, allowing the author to take into 
account a fourth developmental stage, TO, for this child. 
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can be compared to other related studies within this framework, the me- 
thodology chosen is based on a replication of Freiberger’s (2014) analysis 
of a corpus with data from 3 children aged 1;6 to 3;0. 


3.1. CORPUS SELECTION 


An increasing number of resources on L1 acquisition have been made 
available to the scientific community in recent years. For the present 
study, we referred to the CHILDES database (MacWhinney 2000) and 
selected a corpus with a comparable recording process to the method- 
ology used by Freiberger (2014) in order to test the SMH in L1 Ger- 
man acquisition. The criteria we used were the recording of spontane- 
ous speech interactions between a parent and the target child, an age of 
onset of 1;6 and regular recordings up to 3;0. The corpus recorded by 
Demuth and Tremblay (2008) (hereafter ‘Lyon Corpus’) meets these re- 
quirements: four children were, on average, recorded bimonthly (Anais, 
Marie, Nathan and Theotime) from an age onset of 1;0 to the age of 3;0 
(4;0 for Marie) and the children’s utterances have been orthographically 
and phonetically transcribed in CHAT format (MacWhinney 2000). The 
data of one of the children (Marie) had already been analysed with respect 
to the acquisition of CCs (together with data from Tim in Demuth & Mc- 
Cullough 2009). A further advantage of the Lyon Corpus is that data for 
child-directed speech are available from the mothers of the children. 


3.2. METHODOLOGY AND CHARACTERISTICS OF THE CORPUS 


The data analysed here are summarized in Table 1. We have analysed 
18 recordings per child for Anais, Marie and Theotime and 20 recordings 
for Nathan. The table shows that the number of word tokens is highly 
variable from one child to another — as can be expected in this age group — 
ranging from 6,396 tokens (Nathan) to 22,729 tokens (Anais). First of 
all, we analysed the orthographic transcription of the corpus. To start 
with, we performed a frequency count of each lexical item transcribed 
from these recordings and sorted all words containing CCs with the help 
of the FREQ command in CLAN (freq*.cha +t*CHI) followed by man- 
ual extraction of the targets. The following tokens were excluded from 
this analysis: some complex lexical units (parce que ‘because’)*, proper 


* In what follows, we provide English translations for the French lexical units, but not 
for proper names and interjections or onomatopoeia. 
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names (Amtaro) and onomatopoeia (vroum). The analysis shows that in- 
fant speech in French involves an interesting, albeit also variable propor- 
tion of words containing CCs ranging from 4.4% for Anais to 10.6% for 
Theotime). 


Table 1. Number of recordings and tokens per child, percentages of 
CC tokens in the corpus 


Recordings (age) | Totalnumber | CC (% of total) 


Child of word tokens 

Anais 1023 (4.50%) 
Marie 18 (1;0-4;0) 19,496 1642 (8.42%) 
Theotime 18 (1;0-3;0) 20,150 2140 (10.62%) 


Further analysis of the words produced by two of the children (Marie and 
Nathan) demonstrates the variety of CCs found in the speech of the chil- 
dren. Table 2 presents a summary of the cluster combinations according 
to consonant type of the first and the second consonant. Unsurprisingly, 
the table clearly shows that combinations of plosives and liquids are most 
frequent in infant speech (see also Demuth & McCullough 2009). The 
data also provide evidence for the presence of some more complex clus- 
ters involving three or four consonants (see Table 3). 


Table 2. Variety of CCs in Marie and Nathan‘s recordings 


C1 


Fricative /sf/ /{k/ /sk/ /sp/ /st/ /fl/ fr/ /zl/ /vr/ 
Plosive /ks/ /ps/ /tf/ /gb/ /kt/ /mt/ /pt/ /bl/ /br/ /dr/ /g\/ /gr/ / 
kl/ /kr/ /pl/ /pr/ /tr/ 
[s/ /rs/ £f) /rz// | Ad/ Ak/ Am/ /ln/ /At/ /rb/ /rd/ / | /rl/ 
rg) /rv/ rg/ /rk/ /rm/ /rn/ /rp/ /rt/ 
Table 3. CCs with 3 or more consonant sounds in Marie’s and 
Nathan’s recordings 


Liquid 


3+ CCs 
/kspl/ /kspr/ /lkr/ /rbr/ /rkl/ /rsk/ /str/ 


We also analysed the number of CCs for different developmental stages, 
similarly to the analyses by Freiberger (2014). The data summarized in 
Table 4 show a clear progression of the number of CCs with age for each 
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child. However, bear in mind that these data were obtained with the or- 
thographical tier and remain hypothetical with respect to the actual pro- 
duction of the CCs. As such, they mainly demonstrate the diversification 
and complexification of the lexicon in each child. 


Table 4. Number of CCs in the word tokens per child for each age group 


Child 1;6-1;8 1;9—1;11 2;0-2;2 2;3-2;5 2;6-2;8 2;9-2;11 3;0-3;2 
Anaïs 12 11 78 164 271 221 268 


Marie 12 82 200 161 290 525 376 
Nathan 6 31 20 126 69 130 82 
Théo al 213 163 399 239 648 428 


Qualitative analysis of the words (see Tables I and II in appendix) 
shows that the first words containing CCs produced by French L1 chil- 
dren are from various categories with an important proportion of nouns 
(e.g. nounours ‘teddy’, veste ‘jacket’, écharpe ‘scarf’, fleur ‘flower’, 
fraise ‘strawberry’) and interjections/onomatopoeia (e.g. bravo, oups, 
vroum, gling, clac, crac) followed by adverbs or adverbial locutions (e.g. 
s il-te-plait ‘please’, après ‘after’, autre ‘other’, plein ‘lots’, trop ‘too 
much’) and verbs (e.g. regarde ‘look’, prend ‘take’, parti ‘gone’, marche 
‘walk’, ferme ‘close’), mostly in infinitive or participle constructions. 

We then sorted words containing CCs that were classified as either 
phonotactic or morphonotactic in the productions of the four children 
(see Table 5). This was achieved with the FREQ command provided by 
CLAN. In addition, manual extraction of our targets allowed us to estab- 
lish the CC’s frequency in French and their phonemic variety in infant 
speech. Then, the CCs presented in Tables 2 and 3 were set up as tar- 
gets we systematically searched for within a subset of the Lyon Corpus. 
This subset is composed of around 20 hours of recording for each child 
and balanced across different time frames. Within this subcorpus, we 
compared for each target token the expected phonological form and the 
child’s actual pronunciation. Through KWAL commands followed by 
manual extraction, we sorted 5,276 occurrences which were categorized 
according to the following criteria: child, age, gender, CC group (Table 
2 headers), CC (Table 2 contents), CC position (initial, medial, final), PH 
vs. MPH (phonotactic vs. morphonotactic cluster), grammatical class, 
lexical form, number of syllables, phonetic realization, CC realization (0 
= error; 1 = success), and the type of CC error (reduction, substitution, 
omission, repetition, epenthesis, shifted cluster or mixed sounds). 
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Table 5. Total number of phonotactic (PH) and morphonotactic (MPH) 
clusters in each position for the four children 


initial medial final Total 
MPH 289 121 1,010 
PH 1,932 L297 1,030 4,259 
Total 1,932 1,586 1.751 5,269 


Table 5 shows that there are more than four times as many phonotac- 
tic clusters (N = 4,259) than morphonotactic clusters (N = 1,010) in the 
recordings. While PH clusters appear in all positions and even slightly 
more in the word-initial position, MPH clusters mostly appear in the final 
position and to a lesser extent in the middle of words, but not at all in 
the word-initial position. The latter finding is not surprising since word- 
initial MPH clusters are non-existent in French and consequently absent 
from child-directed speech (Demuth & McCullough 2009). 


Table 6. Distribution of the different types of MPH clusters in word-medial and word- 
final position 


Cluster Example Position 

MPH ' medial final eater 
Dr prendre ‘take’ 7 7 
Rd regarde’ ‘look’ 58 541 599 
Rk pourquoi ‘why’ 162 162 
Rm dormi ‘slept’ 14 1 15 
Rt parti ‘gone’ 52 8 60 
sf transforme 3 3 

‘transform’ 

Tr mettre ‘put 164 164 
Total 289 721 1,010 


Table 6 shows the number of tokens and distribution of the different 
MPH clusters in the corpus. It is obvious from the data that some of the 
clusters are very rare and appear in only one specific word. For example, 


> The data concerning these items have, however, to be treated with caution. First, the 
morphotactic status of the cluster is questionable since the different forms of the verb 
regarder ‘to look’ are principally opposed to the noun regard ‘gaze’ (rare in child 
language!), and the noun is derived from the verb, and not the other way around 
(Dressler, personal communication). Additionally, as we will see later, this cluster is 
very frequently reduced through omission of the liquid by the children, but also in 
child-directed speech (Demuth & McCullough 2009). 
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‘sf’ is limited to the word transforme ‘transform’ and only appears in this 
configuration in a word-medial position. Other clusters show high overall 
frequency, but again, they appear in only one very frequent token (pour- 
quoi ‘why’, regarde ‘look’). 


We then proceeded to the analysis of the phonological tier allowing us 
to establish the proportion of CCs correctly produced. Table 7 recapitu- 
lates the percentage of correctly produced clusters for each child in the 
different developmental stages. The high variability across the children 
is striking. While two of the children (Marie and Theotime) reach more 
than 80% correct performances by age 3, the proportion is only 40% 
for Nathan (and not yet stable), and it does not exceed 10% for Anais, 
for whom no real progression is evident over the recording periods. 
Furthermore, it cannot be excluded that the actual production of CCs 
is related to the use of words containing CCs, the two children obtain- 
ing the highest mastery of CCs also being those who produce the most 
words containing them (see Table 4). However, it also has to be noted 
that Anais, who shows clear difficulties with CCs, seems to be rather 
talkative, as indicated by the number of tokens produced (see Table 1). 


Table 7. Percentage of correctly produced consonant clusters per child and per age group 


Child 1;6-1;8 139-1311 2;0-2;2 23;3-2;5 2;6-2;8 2;9-2;11 3;0-3;2 


Anais 0 9.09 2.56 0.61 9.96 4.98 10.07 
Marie 0 10.98 9.00 34.78 57.24 81.14 80.05 
Nathan 0 3:23 10.00 3.17 14.49 14.62 40.24 


Theotime 7.84 41.31 61.35 70.68 81.97 87.96 84.11 


We then applied a generalized linear mixed model investigating the role 
of the following factors: age, position and PH vs. MPH status on the suc- 
cessful pronunciation of the different CCs. The results of this analysis are 
presented in the next section. 


3.3. STATISTICAL EXPLORATION OF THE FACTORS FAVOURING 
SUCCESSFUL PRONUNCIATION OF CCS 


For the statistical exploration, a generalized linear mixed model was 
chosen because our data consider a binary nominal variable (is a CC pro- 
duced correctly or not) in relationship with a list of factors, some of these 
being random (e.g. the child’s proficiency or the difficulty in pronouncing a 
CC) and others considered as independent variables (e.g. PH/MPH status). 
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With such a model, the slope of the logit is modelled over time (that is, the 
fixed effect on the probability that a CC is correctly produced), depending 
on the position of the CC, its PH or MPH status, the number of syllables in 
the word, and so on. We have defined two types of random effects: 


- the children (who are more or less performant [1 ]) 


- the CC types (which are more or less difficult to produce and 
more or less frequent [2]) 


Model 0 inspected the evolution of performance depending on the age 
of the children. The 4 children and all 10 CC classes (e.g. liquid+plosive 
or plosive+obstruent) were taken as random effects in the model. The 
results show that with growing age, the probability of a ‘correct’ answer 
increases significantly (B = 0.63, p = 0.001), as can be expected with typi- 
cally developing children. Figure 1 represents the probability slope that 
should be expected for any child, following Model 0’s data. 


Model 0 : CC correctly pronounced * age 


osoo ————— 


Figure 1. Percentage of consonant clusters cor- 
rectly pronounced per age group (Model 0) 


For Model 1, we added the morphonotactic status of the CC (PH vs. 
MPH) to the factor age and took into account the possible interactions 
between these factors. As for Model 0, an increase in age results in an 
increase of the probability that a CC is correctly produced (B = 0.71, 
p = 0.001). The model also suggests that the MPH status over all age 
groups had a positive effect on the pronunciation of CCs (B = 0.72, p 
= 0.02). However, the negative interaction between the factor age and 
MPH status tempers this result (B = -0.37, p = 0.001). To refine these 
first results, and as MPH CCs are not spread evenly across positions in 
the word, the following statistical models take into account the position 
of the CC in the word. 

Model 2 considered the CCs located in the word-initial position as 
well as the age groups. As CCs in this position cannot have MPH status, 
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the latter factor was excluded from the model. Model 2 reports a positive 
effect of increasing age (B = 0.59, p = 0.001) and the initial position of 
the CC (B = 0.75, p < 0.01) on the success of pronunciation of a CC. The 
interaction between these two factors also has a positive, albeit slight, ef- 
fect (B = 0.15, p = 0.001). 

Model 3 investigated CCs located in the word-medial position, age 
groups, and the CC’s MPH status. Again, rising age results in an increase 
in the probability that a CC will be pronounced correctly (B = 0.63, p = 
0.001). Nonetheless, when considered alone, the MPH status of the CC 
(B =-1.39, p = 0.001) and the medial position of CCs (B = -0.73, p < 0.01) 
seems to decrease the chance that a CC will be produced correctly. If we 
now consider the interactions in Model 3, there is a positive interaction 
between MPH status*medial position (B = 1.76, p = 0.001) with respect 
to correct pronunciation of the CC, while a less pronounced positive ef- 
fect is observed for the age*CC’s medial position interaction (B = 0.20, 
p < 0.01). 

Finally, Model 4 focused on the CCs in the word-final position, while 
also taking into account age and MPH status. The age factor remained, 
once again, the major effect (B = 0.80, p = 0.001) in the model. When 
isolated, factors such as the final position and MPH status had no signifi- 
cant effect in this model. However, the interactions age*final position (f 
= -0.35, p = 0.001) and MPH status*final position (B = -0.54, p = <0.05) 
showed a moderate negative effect on the probability that a CC will be 
produced correctly. 

To summarize our results, the statistical analysis showed that the fac- 
tor age was the most important influence on the children’s output, as can 
be expected from typically developing children. Concerning the CC’s po- 
sition in the word, French children tend overall to a left-side preference 
in the development of the pronunciation of CCs, with word-initial CCs 
tending to be produced correctly at an earlier age. As for the CC’s MPH 
status, our models observed that only MPH clusters in a medial position 
(and probably specifically for the latest age group considered here) had 
positive effects on the success of pronunciation of clusters. That being 
said, it must be kept in mind that these results may be influenced by sev- 
eral factors inherent to our corpus: 


- MPH clusters are relatively scarce in French words, and even 
more so in children’s spontaneous speech corpora; 


- MPH clusters occur only in a limited number of words, but these 
are likely to be very frequent French words, such as pourquoi 
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‘why’ with a cluster in the medial position, or regarde® ‘look’, 
in imperative mode, with a cluster in the final position. If re- 
garde is used as a deictic from very early on (543 tokens, from 
age 1;6 in the corpus) by both children and caregivers, pourquoi 
(162 tokens, from age 2;0 in the corpus) emerges later in the 
child lexicon, i.e. at a more advanced developmental stage. 


Nevertheless, the results also showed that there is a lot of variation 
among the children, which is why the next section seeks to explore fur- 
ther the individual developmental trajectories. 


3.4. EXPLORATION OF THE INDIVIDUAL DEVELOPMENTAL TRAJECTORIES 


The developmental trajectories for each of the four children were ex- 
plored with reference to the type of the given responses, first for all CCs 
and then specifically for MPH clusters. The results for overall CC produc- 
tion (see Figure 2) illustrate the inter-individual differences: while two 
of the children (Anais and Nathan) produce hardly any correct clusters 
during the entire period of investigation (1;6—3;0), the other two (Marie 
and Theotime) show a steady increase in correct production from age 2;0 
(Marie) and age 1;6 (Theotime) respectively. Additionally, error patterns 
are also interesting and demonstrate a specific trajectory for each child. 
While substitutions are rare in the production of all four, the children dif- 
fer considerably with respect to the amount and distribution of the other 
error types. Theotime is very performant from the beginning, and omis- 
sions of CCs are rare in his speech even at the first recording, and remain 
close to zero thereafter. By preference he uses reductions of CCs, but 
even these are exceeded by correct productions from as early as around 
age 1;8. Both Nathan and Marie start with 100% omissions at age 1;6, 
but these are caught up by reductions at around age 1;8 for Marie and 2;4 
for Nathan. Anais, who shows the most severe difficulties with CCs, has 
a quite different distribution of these two error types: from the beginning, 
she produces more reductions than omissions, and despite a lot of varia- 
tion in the different recordings, this does not change substantially over the 
period investigated. 

We then looked specifically at the distribution of these error types 
and the correct productions of CCs in MPH clusters only (see Figure 3). 
Anais’ CC productions remain close to zero during the whole investiga- 


é But see our reservations about the MPH status of the final cluster in regarde in note 3 
(section 2.2). 
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Figure 2. Percentage of occurrence of each type of CC event coded in the corpus for each 
child/age group. Correctly produced CC (blue), CC omission (red), CC reduction (green), 
CC substitution (purple). 
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Figure 3. Percentage of occurrence of each type of event coded in the corpus for MPH 
clusters only * child * age group. Correctly produced CC (blue), CC omission (red), 
CC reduction (green), CC substitution (purple) 
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tion period, while errors seem to be distributed randomly between re- 
ductions and omissions. In Nathan’s speech, correctly pronounced MPH 
CCs appear only at the end of the period, from age 2;9 onwards, and 
reductions start to exceed omissions from around age 2;5 onwards. Marie 
shows a similar pattern, but much earlier: correct productions appear at 
age 2;0 and the number of reductions exceed the omissions from around 
age 1;10. For Theotime the trajectory here is more similar to the pattern 
shown by Nathan and Marie, but even earlier than Marie and with higher 
success rates. Despite the high variability across children, these data sug- 
gest that there is an overall developmental pattern with the number of 
omissions decreasing while the number of reductions increases across the 
age groups. 


4. DISCUSSION 


The results of our data analysis are in line with the other studies on the 
acquisition of consonant clusters in French, showing that all these clus- 
ters are subject to different types of modifications (namely reduction to 
the least sonorous segment, gliding, insertion of schwa, etc.) before they 
are produced in a target-like manner. This has been documented, for ex- 
ample, in the study conducted by Andreassen (2013) on 13 monolingual 
children aged 2;2—3;2 years. The data are also consistent with the studies 
on CC acquisition in general (for a review, see McLeod, van Doorn & 
Reed 2001) showing that CCs emerge very progressively around age 2 
(e.g. French 1989; Lléo & Prinz 1996, for Spanish and German). Over- 
all, the studies on CC acquisition suggest that production of CCs starts 
around age 2, but, independently of the target language, their production 
is rarely correct at that age. Additionally, the age of mastery seems to be 
highly variable across children, as also documented by the data from the 
four children we considered in this study showing considerable variation 
in the correct CC productions of, for instance, Anais (10.07% correct at 
age 3;0—3;2) and Theotime (84.11% correct at the same age). With regard 
to American English, Shriberg and Kwiatkowski (1980) observed 90% 
correct at age 4, while Smit el al. (1990), on the contrary, report that only 
few productions were correct at age 4. In their sample the majority of CCs 
seemed to be mastered at age 6 or 7, though some difficulties persisted 
even up to age 8 or 9. 

Among the factors generally found to influence the age of mastery of 
pronunciation are the position and structure of the cluster. Yet the obser- 
vations made with respect to these factors are still inconsistent. For exam- 
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ple, McLeod and colleagues, in an extensive literature review on CC ac- 
quisition, state that “In languages other than English, word-final CC have 
been reported to be acquired before word-initial clusters” (McLeod et 
al. 2001: 101). However, as regards French, word-initial clusters tend to 
be produced earlier than word-final clusters (see Demuth & McCullough 
2009; Kirk & Demuth 2005). While it is also acknowledged that the po- 
sition no longer matters in older children, such inconsistencies between 
studies need to be elucidated. 

Additionally, McLeod et al. (2001) suggest that clusters involving 
plosive+liquid are easier to acquire than clusters involving fricative+liquid. 
This is in line with what has been observed by Demuth and colleagues 
(Demuth & McCullough 2009; Demuth & Tremblay 2008) for the acqui- 
sition of CCs in French. However, there are also a number of other, less 
investigated factors that are likely to play a role in language-specific ac- 
quisition trajectories: the syllable structure, foot structure, frequency (e.g. 
Kehoe et al. 2008), saliency (Baroni 2014) and the primary and secondary 
status of the cluster (Andreassen 2013). For instance, there is a strong ten- 
dency for French towards monosyllabism resulting in a reduced number 
of morphemes per words and hence in fewer morphological boundaries 
and reduced morphonotactics. 

Concerning our main research questions, the findings of the present 
study suggest that there are differences in the processing of MPH and PH 
clusters in the acquisition of L1 French. These are, however, modulated 
by the fact the MPH clusters mainly appear in the word-final position 
in French (and to a lesser extent in a medial position), a position that 
seems, for other reasons, to be less favourable in early acquisition. The 
observations we made are consistent with the predictions of the SMH in 
respect of the phonological and morphological characteristics of French 
(Dressler & Dziubalska-Kotaczyk 2006). From a phonological point of 
view, French is a largely vocalic language and, as such, has only a limited 
number of consonant clusters. On the morphological side, French is one 
of the weakest inflecting languages among the fusional languages (at least 
as far as oral French is concerned). Consequently, and as predicted by 
morphonotactics, the number of MPH clusters is rather low, at least in the 
vocabulary of the age groups investigated in the present study, and results 
have to be treated with caution. In order to counterbalance frequency ef- 
fects, it might be interesting to compare data from younger children ac- 
quiring languages with many MPH clusters with older children acquiring 
languages with few MPH clusters. 
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Another issue that needs to be discussed is the actual presence of the 
investigated clusters in child-directed speech to which these children are 
exposed. Recent studies on speech directed to French infants suggest that 
French parents tend to reduce omission of schwa when addressing their 
young children (e.g. Andreassen 2011; Liégeois, Saddour & Chabanal 
2015). Reduced omission of liquids in consonant clusters has also been 
reported for child-directed speech in connection with one of the children 
investigated in the present study in respect of word-initial clusters, where 
parents tend to produce more complete clusters when addressing their 
young children compared to their usage in adult-directed speech (Demuth 
& McCullough 2009). However, things seem to be more variable with 
regard to word-final clusters: while the results of the same study showed 
very low omission rates for /R/-plosive clusters (e.g. barbe ‘beard’ in our 
corpus — with the exception of regarde ‘look’, one of the most frequent 
productions in our corpus where omission rates are fairly high), omission 
rates were high for final plosive-/R/ clusters (75% for Marie’s mother, 
e.g. autre ‘other’). This suggests that the advantage observed for the ac- 
quisition of word-initial CCs is not only due to overall frequency but also 
to the fact that word-final clusters are often phonetically reduced in the 
input (see Demuth & McCullough 2009, for more details). 


5. CONCLUSION AND PERSPECTIVES 


Based on a longitudinal corpus analysis of natural data from four 
children, the present study provides preliminary data on the scope of 
the SMH (Dressler & Dziubalska-Kotaczyk 2006) as regards French. 
In accordance with the predictions of the SMH, our data show that the 
morphonological status of a CC does not per se facilitate its acquisition, 
which is largely modulated through other factors such as frequency and 
position in the word, at least in the initial developmental stages investi- 
gated in the present study. However, of course, the present investigation 
has only permitted a preliminary exploration of the issue of the interac- 
tion of phonological and morphological aspects in the development of 
CCs in French. An extension of the study to later developmental stages in 
older children, allowing us to weight the influence of frequency, is clearly 
one of the primary perspectives arising from the present investigation. It 
would also be interesting to look more closely into the processing cost 
with regard to the processing of inflected forms without clusters and to 
inspect error patterns related to PH/MPH status of CCs. 
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V. The natural perceptual salience of affixes 
is not incompatible with a central view of 
morphological processing 


HELENE GIRAUDO! 
KARLA ORIHUELA! 
BASILIO CALDERONE! 
BARBARA KOPKE? 


1. INTRODUCTION 


The Strong Morphonotactic Hypothesis (Dressler & Dziubalska- 
Kotaczyk 2006) assumes that phonotactics helps in the decomposition of 
words into morphemes. Accordingly, sequences occurring over a morpheme 
boundary (e.g. mords, mordent, mordre /mow/ /mogd/ /moxds/) correspond 
to a prototypical morphonotactic sequence that should be processed faster 
and more accurately than phonotactic sequences (e.g. ordre /ogdg/). 

While this issue is usually explored within studies on typical and atyp1- 
cal first language acquisition, a recent study carried out by Korecky-Kroll 
et al. (2014) tested this hypothesis with German-speaking adults. One 
experiment using a letter search task (1.e. find a letter like, for example, T 
at different positions — initial, medial, final — in a visual word like Taube 
‘dove’, dank-te ‘thanked’ and pack-t ‘(s/he) packs’) investigated whether 
sublexical letter sequences were found faster when the target sequence 
was separated from the word stem by a morphological boundary (e.g. 
pack-t) than when it was a part of a morphological root (e.g. Lift ‘lift’). 
The results showed that the presence of a morpheme boundary led to 
shorter reaction times (RTs) and fewer errors, whatever the target cluster 
position in the word. The authors concluded that phonotactics helps in 
the decomposition of words into morphemes without, however, explicitly 
considering that it is a direct consequence of a morphological decomposi- 
tion mechanism taking place during lexical access. 

More recently, Beyersmann et al. (2014) examined the morpho-ortho- 
graphic hypothesis according to which all complex forms are segment- 
ed into morphemes during lexical access. The prefixed and suffixed let- 


' CNRS, CLLE-ERSS, University of Toulouse (UT2). 
2 LNPL — Laboratoire de NeuroPsychoLinguistique, University of Toulouse (UT2). 


124 H. Giraudo, K. Orihuela, B. Calderone, B. K6pke 


ter strings they manipulated comprised real stems and affixes but never 
formed an existing word in French (e.g. propoint, filmure). They used a 
letter search task in which adult French speakers had to decide whether the 
target letter was present or absent in a string of letters (e.g. “R’ in propoint 
or filmure). The results revealed that the letter search took longer in suf- 
fixes compared with non-suffix endings but not for prefixes compared with 
non-prefix beginnings. Moreover, performance was not affected by letter 
cluster frequency. The difference in processing suffixes relative to non- 
suffixes was interpreted as reflecting a chunking/affix stripping mechanism 
(cf. Taft & Forster’s serial hypothesis 1975) that operates on functional 
units such as suffixes during lexical access. Furthermore, the authors inter- 
preted the absence of effects for the prefixed non-words (i.e. no significant 
difference between pre-fixed and non-prefixed non-words) in terms of the 
different semantic and syntactic functions of prefixes relative to suffixes. 

Giraudo and Grainger (2003) also found an asymmetry in the pro- 
cessing of prefixed vs. suffixed words in a series of masked priming ex- 
periments conducted with French complex words. More precisely, they 
found morphological masked priming effects, but only for prefixed words 
(i.e. enjeu ‘stake’ primed envol ‘flight’, while ennui ‘boredom’, a pseudo- 
complex in which en- is not a prefix and /apin ‘rabbit’, an unrelated word, 
did not induce any priming effect), suggesting that only prefix series are 
activated at the very early stages of visual word recognition. The authors 
interpreted this asymmetrical processing in terms of the different linguis- 
tic functions these two types of affixes imply, in particular the fact that 
prefixes usually carry more transparent semantic information than suffix- 
es, whose function is much more related to syntax. Accordingly, morpho- 
logical priming effects using affixed words would rely on the semantic 
relationships in prime-target pairs — prefix priming effects being clearly 
obtained. This result constitutes a challenge for decompositional models 
situating morphological effects at a sublexical level that should be insen- 
sitive to semantics. Moreover, the conclusions reached by Beyersmann et 
al. (2014) are not only contradictory to what Giraudo and Grainger (2003) 
found about affix processing in French but they are also incompatible 
with what Korecky-Kroll et al. (2014) found concerning the target cluster 
position, which did not interact with RTs in their experiment. Therefore, 
the conclusion about the kind of cognitive processes underlying affixed 
word analysis is not clear, while the interpretation of the effects obtained 
with a letter search task has to be re-examined. 

As a consequence, and because Beyersmann et al. (2014) tested non- 
words instead of existing words, meaning that they did not control the 
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morphonotactic characteristics of their material, it is worth carrying out 
a new experiment using words and controlled materials in order to tease 
apart the morphonotactic effects from the positional effects in word recog- 
nition. Working on non-words presents the advantage of creating the ma- 
terials very easily, especially when formal aspects have to be controlled. 
However, working on the morphological issue, the manipulation of non- 
words, even if they are morphologically complex, restricts the conclusions 
derived from the results. First, the morphological structure of a given word 
actually corresponds to both the form and meaning information, and sec- 
ond, this word is embedded in a network formed by its morphological 
family and series. Therefore, the manipulation of non-words suppresses 
the two morphological dimensions of complex words, their syntagmatic 
and paradigmatic structure (see Blevins 2014 for discussion). 

Finally, it is worth highlighting that, according to us, the letter search 
task does not permit examination of lexical access per se but rather mor- 
phological salience. Recently, Giraudo and Dal Maso (2016) discussed 
the issue of morphological processing through the notion of morpholog!- 
cal salience — defined as the relative role of the word and its parts — and 
its implications for theories and models of morphological processing. The 
issue of the relative prominence of the whole word and its morphological 
components has indeed been overshadowed by the fact that psycholin- 
guistic research has progressively focused on purely formal and surface 
features of words, drawing researchers’ attention away from what mor- 
phology really is, systematic mappings between form and meaning. While 
we do not deny that formal features can play a role in word processing, 
an account of the general mechanisms of lexical access also needs to con- 
sider the perceptual and functional salience of lexical and morphologi- 
cal items. Consequently, if we acknowledge the sensitivity to the word’s 
morphological structure, we claim that it corresponds to secondary and 
derivative units of description/analysis. 

In the present research, a letter search task was carried out using 
French words; the main comparison is between the words that include 
the target letter after a morphonotactic boundary (e.g. vivre ‘to live’) and 
those with a purely phonotactic one (e.g. centre ‘centre’). The hypothesis 
is that morphonotactic segmentation will be facilitated because of a dou- 
ble salience conveyed in the boundaries, as it is not only phonological but 
also morphological. Position effects will also be explored, as we compare 
the initial (position 1) vs. the final position (position 2) of letter targets. 
The results are presented with both a categorical analysis (comparing the 
different conditions) and a linear mixed-effects model. 
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2. METHODOLOGY 


2.1. PARTICIPANTS 


Thirty participants were recruited at the University of Toulouse, all 
native French speakers. The age range spanned from 18 to 30 years old 
(average 23); 7 were male and 23 were female. All participants were 
right-handed with normal or corrected to normal vision. 


2.2. ITEMS 


Sixty French word forms (the same inflectional form) were used as 
targets; those used for the critical condition can be segmented morphot- 
actically, while for the 3 other control conditions the syllabic segmenta- 
tion was purely phonotactic (phonology and morphology did not corre- 
spond). All words included a combination of the letters ‘RE’, half of the 
time at the beginning of the word and half of the time at the end. Four 
conditions were created in order to counterbalance the position (initial— 
final), the boundary type (morphological and phonological or only pho- 
nological) of the bigram to be searched, and the part of speech status 
(verb or noun in singular and infinitive) of the word (see Table 1 for 
stimuli characteristics). 

15 verbs morphonotactic (MP verb _RE/P2): verbs that had the let- 
ters ‘RE’ in a final position (after a consonant at the end of the word), 
and corresponded to a morphological and phonological boundary. For 
example, vivre ‘to live’ contains viv- as a morphological base (stem) and 
re- as a Suffix (the infinitive mark). 

15 nouns phono final (P noun_RE/P2): nouns that ended with ‘RE’, 
but the phonotactic boundary did not correspond to the morphological one. 
For example, in centre ‘centre’, -re is not a suffix and cent- is not a stem. 

15 verbs phono initial (P verb_ RE/P1): verbs that started with ‘RE’, but 
the phonotactic boundary did not correspond to the morphological one. For 
example, in refuse ‘refuse’ re- is not a prefix and -fuse is not a stem. 

15 nouns phono initial (P noun_RE/P1): nouns that started with 
‘RE’, but the phonotactic boundary did not correspond to the morpho- 
logical one. For example, in religion ‘religion’ re- is not a prefix and 
-ligion 1s not a stem. 

In order to perform the task, 60 filler words (30 nouns and 30 verbs) 
matched in length and frequency with the target words, but not having 
‘RE’ clusters in their spelling, were used. All word frequencies were tak- 
en from the Lexique database (New et al. 2001). 
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Table 1. Stimuli controlled for frequency and length 


Condition Length 
Morphono (verb_P2) 6.13 
vivre 


Phono (noun_P2) 32.04 6.20 


centre 


Phono (verb_P1) 30.12 6.33 
refuse 


Phono (verb_P1) 15.89 6.20 
religion 


2.3. PROCEDURE 


Participants were seated 50 cm from the computer screen and asked 
to perform a ‘letter search task’. They were instructed to respond as rap- 
idly and accurately as possible to whether the cluster of letters ‘RE’ was 
present or not in the word to be displayed on the screen. Participants re- 
sponded ‘yes’ by pressing one of two response buttons with the forefinger 
of their right hand and ‘no’ by pressing the other response button with the 
forefinger of the left hand. 

The DMDX software (Forster & Forster 2003) was used. Each trial 
consisted of the following sequence of stimuli: the letters to be searched 
(RE) presented in uppercase (for 700 ms), followed by a fixation mark 
(1000 ms), a French word in lowercase (50 ms), which, in turn, was re- 
placed by a mask (##########) that remained on the screen until the 
participant responded (for a maximum of 1500 ms). After 10 practice 
trials, participants received the 120 experimental trials in one block in a 
randomized order (see Figure | for a trial example). 


1000ms 


1500ms 


HHHHHHH 


Figure 1. Trial example for the letters search task (RE) 


> Frequency (Fx) calculated in frequency per million from the Lexique corpus database 
(New et al. 2001). Length corresponds to the average number of letters. 
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3. RESULTS 1 


For the statistical analyses, filler words (with no RE clusters) are not 
considered. Accuracy across all participants was above 80% but no sig- 
nificant differences were found (see Table 2 for accuracy and reaction 
time means). For the reaction time (RT), trials considered ‘errors’ were 
not taken into account (6% of the data), and a trimming procedure was 
used: excluding responses under 300 ms (1.6% of the data) and 2.5 SDs 
above or below the mean response time of each participant (2.89% of 
the data). For the RTs (see Figure 2 for RT means), an ANOVA was con- 
ducted using participants (F1) as a random factor, treating the boundary 
type as a within-participant factor (repeated measures). 


Table 2. Accuracy (in error rates) and reaction times (in milliseconds) means 


RT MP(verb_P2) | P(noun_P2) P(noun P1) P(verb_P1) 
Mean 606.20 664.38 573.55 607.18 
SD 97.20 128.86 102.10 110.63 


ACC MP(verb_P2) | P(noun_P2) P(noun P1) P(verb_P1) 
Mean 4.67 7.56 5.11 5:33 
SD 6.35 8.53 5.45 5.37 


700 
680 
660 
640 
620 

£ 600 


580 
560 
540 
520 
500 


MP(verb_P2) P(noun_P2) P(noun_P1 


* 
) P(verb_P1) 


** indicates a significant difference, * indicates a trend to significance 


Figure 2. Reaction time results (ms) with standard error bars (SE) 
of the mean for each of the different conditions 


A main effect of position was found, and planned comparisons found 
that the only significant difference among the four conditions was that the 
response times for the ‘MP’ condition were significantly faster F(3,59) = 
8.72 p < .01 than those obtained for the ‘P’ with RE in the first position 
(beginning). 
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Focusing on the comparison between the two conditions where the tar- 
get letters were at the beginning of the word and the only difference was 
the boundary type, the MP condition produced a facilitation effect (over 
the P one). To disentangle these findings and explore whether the effects 
are due to the fact that these conditions differ in grammatical category 
(part of speech), a comparison between these variables was undertaken, 
but it showed no significant difference. 


3.1. POSITION 


An ANOVA revealed a significant difference between the initial and 
final position: F(1,59) = 9.92, p < .01, indicating that participants’ re- 
sponses are faster (by 76.33ms) when the target letters (‘RE’) are in an 
initial position, taking together conditions P(noun P1) and P(verb_ P1), 
versus when they are in a final one, MP(verb_P2) and P(noun_P2). 

The average RT for the compiled values obtained for the conditions 
with ‘RE’ at the end or at the beginning of the word are shown in Table 3 
and Figure 3. 


Table 3. Mean average RT (and standard deviation, SD) 
of position (initial vs. final) in milliseconds 


Initial (P1) 587.3 (133.66) 
Final (P2) 633.63 (160.66) 


Net Effect (P1-P2) | 76.33 


kk 
640 
620 
£ 600 
580 
560 
540 
520 
500 
P1 P2 


** indicates a significant difference (p < .05) 


Figure 3. Mean average RT of position 
(initial vs. final) in milliseconds with SE bars 
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Looking only at the ‘position’ effect, it can be argued that ‘RE’ is iden- 
tified faster (regardless of whether the boundary is morpho-phonological 
or purely phonological) when it is at the beginning of the word; this may 
be due to the fact that we follow a left-to-right reading direction. Never- 
theless, in order to disentangle the findings, when comparing morphono- 
tactic to merely phonotactic boundaries, there is a significant effect for 
the morphonotactic condition. If position is the only factor, manipulation 
or variable considered, then ‘RE’ is always found faster when at the be- 
ginning of the word but, within this position, ‘RE’ is found significantly 
faster when the boundary is morphonotactic. This is probably due to the 
dual information conveyed, which enhanced the morphological salience, 
aiding the analysis of the word into its constituents and facilitating iden- 
tification of the target letters. 


3.2. PART OF SPEECH (GRAMMATICAL CATEGORY) 


The average RT for the compiled values obtained for the conditions 
containing ‘RE’ in verbs or nouns are shown in Table 4. An ANOVA re- 
vealed no significant difference between them F(1,59) = 1.93, p = .17, 
indicating that even if participants’ responses are slower (by 16.54 ms) 
when the target letters (‘RE’) are in a verb (condition MP verbRE and 
P_RE verb) than when they are in a noun (P_nounRe and P_RE noun), 
this numerical difference is not statistically significant. The average RT 
for the compiled values obtained for the conditions containing ‘RE’ in 
verbs or nouns are shown in Table 4 and Figure 4. 


Table 4. Mean average RT per part of speech 
(verb vs. noun) in milliseconds 


PART OF SPEECH | RT (SD) 
Verb 602.19 (136.51) 


Noun 618.73 (161.22) 
Net Effect (V-N) -16.54 
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660 
640 
620 

£ 600 
580 


560 


540 verb noun 


Figure 4. Mean average RT per part of speech (verb vs. noun) 
in milliseconds with SE bars 


3.3 MORPHONOTACTIC VS. PHONOTACTIC 


In the comparison of average RTs between the two conditions where 
the target letters were at the beginning of the word (P1) and the only 
difference was the boundary type, a facilitation effect was found for the 
morphonotactic boundary type (compared to the purely phonotactic). Fig- 
ure 5 shows the average RT and significant difference between the type of 
boundary (MP or purely phonotactic). 


700 
680 
660 
640 
620 

£ 600 
580 
560 
540 
520 


500 MP 


Pnoun_/c/+re 


** indicates a significant difference (p < .05) 


Figure 5. Reaction time results in milliseconds with SE bars of the 
mean for morphonotactic vs. phonotactic P2 conditions 
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4. RESULTS 2 


It is also interesting to examine the possible correlation between the 
RT and a set of variables related to all formal (surface) aspects of the 
words, such as the length in terms of characters, the number of syllables 
and the orthographic neighbourhood. The reaction times (log) of correct 
responses were also analysed using a linear mixed-effects model. The 
fixed factor predictors included are the following: 

a) condition (MP verb_RE/P2, P noun _RE/P2, P verb _RE/P1, P noun 
RE/P1) 

b) ortho_neigh (number of orthographic neighbours) 

c) nbsyll (number of syllables), 

d) nbletters (number of letters or word length) 

e) TP_ORT (orthographic transitions probability) 

f) FrWaC_ freq (the log frequency of the form in FrWAC corpus, Bar- 
oni et al. 2009). 


Table 5. Linear mixed-effects results of the reaction time data 


D 
participant 


(Intercept) |0.0317204 |0.1781 
Residual 0.0582263 | 0.2413 a 
Predictor Estimate Std. Error df t 
rte eae [|| — 


P(noun_RE) 
ill ill 
P(RE_noun) 
P(RE_verb) 


orto neigh [00017 [oos asss [user [ovo O 
osya [ooo [0.022981 [5092 (0205 [osn | 


nbletters 0.019177 [0.010635 [49.74 |1.803 [0.0774 J. 
TP_ORT -0.160263 |0.180087 [46.15  |-0.89 [0.3781 
log(FrWaC__|-0.00961 [0.0038 20.54 |-2.529 |0.0197 
freq) 


Signif. codes: 0 ****? 0.001 ‘**? 0.01 **? 0.05 <° 0.1 57 1 


Random 
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Participants and items were included as random effects. In particular, 
the TP_ORT variable reports the transitional probability of bigrams cor- 
responding to the last letter of the target word (e.g. <v> in viV-re and the 
next subsequent letter of the letters to be searched (e.g. viVRe, that is the 
conditioned probability that an <r> follows a letter <v>). Table 5 shows 
the results obtained from the linear mixed-effects model and Figure 6 
shows the significance of standardized fixed effects. 

A significant effect of boundary type is found, showing that partici- 
pants identify the letters within words that included morphological and 
phonotactic boundaries faster than in words with a purely phonotactic one 
(in P2). The intercept (morphonological condition: MP verb_RE/P2, our 
‘base case’) was significant, meaning that the RTs (for finding the letter 
‘RE’ in this condition) seem to reflect a facilitation effect, while the pho- 
nological condition (P noun _RE/P2) shows an inhibitory effect (longer 
RTs are needed to respond). In both of these conditions, the target letters 
are in the final position of the word (P2). 


log(FrWaC_freq) - i 


i 
1 
TP_ORT- ite 
1 
nbletters ~- a o 
\ 
‘ 
nasya 4 Ee ea 
1 
i 
1 
ortho_neigh - — hp 
1 
1 


condition (P(RE_verb)) = ee ee ee et 
condition (P(RE_noun)) - i 
condition (P(noun_RE)) - ie ____ 
-01 o0 01 
Figure 6. Standardized fixed effects showing that frequency, 
length and condition are significant 


The model predictors are the following (see Figure 7): 

a) Condition: Reaction times (log) during the letter search task for 
each of the boundary-type conditions (1) MP verb RE/P2, (2) P 
noun RE/P2, (3) P verb _RE/P1 and (4) P noun RE/P1. 

b) Frequency (FrWac_freq): The higher the frequency of the word 
form (same inflectional form), the lower the reaction times re- 
sponses associated with it. 

c) Length (nbletters): Contrary to frequency (and as expected), as 
the length of the word increases, the RT also increases. 
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d) Number of Syllables (nbsyll): Follows the trend for the length of 
the word, but the effect is not significant. 

e) Orthographic Neighbours (ortho_neigh): The more orthographic 
neighbours a word form has, the trend for the reaction times is to 
decrease, but this is not significant. 

f) Transition Probability (TP_ORT): No significant effect was 


found. 
condition FrWaC_freq 
6.375 - 
6,350 - 
6.325 - 
6.300 - fo 
6.275- 
6.250- + i + r 1 ‘ : ' 
1 2 3 4 100 200 300 400 
nbletters nbeyil 
6.375 
6.350 - 
3 6.325 - 
8 6.300- {ed 
6.275- 
6.250- . ' 1 ' ' 1 ' ' ' ' 
4 5 6 7 6 10 15 2.0 2.5 3.0 
ortho_neigh TP_ORT 


6.375- 
6.350 - 
6.325 - 


e300: Taaa = 


6.275- 
6.250- , 


Figure 7. Marginal effects of model predictors 


1. CONCLUSION 


The Strong Morphonotactic Hypothesis (Dressler & Dziubalska- 
Kołaczyk 2006) was tested following the letter search task paradigm using 
words in French with morphonotactic and phonotactic boundaries with 
different positions for the targets across the materials. The target could be 
either at the beginning of the word (position 1) or at the end (position 2). 
Globally, the results showed that prototypical morphonotactic sequences 
were processed faster than phonotactic sequences, suggesting that pho- 
notactics help us perceive the internal word structure in terms of mor- 
phological construction by enhancing their morphological salience. The 
presented results revealed that this was the case for position 2 (but not 
position 1): letter search times were longer when the target letters were 
embedded in a phonotactic condition compared to a morphonotactic one. 

Our findings also provide indirect evidence for the left-to-right bias 
in word-recognition processing asymmetry across word beginnings and 
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ends, and we assume that the mechanisms underlying printed word recog- 
nition are shaped by the physical constraints imposed by the reading di- 
rection (Giraudo & Grainger 2003). Responses were indeed significantly 
longer for the items that had the target letters in the second position (P2) 
compared to those in the first position (P1). 

A significant effect of frequency was also obtained, showing that the 
more frequent a word 1s, the faster the reaction times are, while all other 
variables (like orthographic neighbours, transition probability and gram- 
matical category) were found to be not statistically significant. According 
to our view of morphological processing (Giraudo & Voga 2014; Voga 
& Giraudo 2017), morphology plays a central role in the cognitive sys- 
tem at two levels: at a perceptive/surface level (when the morphological 
structure is salient, as is the case for morphonotactic words), and at a 
central level (where paradigmatic relationships organize the word repre- 
sentations coded in the mental lexicon). We claim that finding sensitivity 
to morphology and effects of abilities is compatible with a paradigmatic/ 
construction view of morphology (e.g. Booij 2010). On the one hand, 
morphological salience can speed up lexical access in adult word compre- 
hension and help to develop the morphological awareness of those learn- 
ing to read. Morphological awareness refers to children’s “conscious 
awareness of the morphemic structure of words and their ability to reflect 
on and manipulate that structure” (Carlisle 1995: 194). Accordingly, it 
contributes to reading ability (e.g. Brittain 1970; Carlisle 1995; Deacon 
& Kirby 2004; Mahony, Singson & Mann 2000; Nagy, Berninger & Ab- 
bott 2006; Nunes & Bryant 2006; Kirby et al. 2012). Consequently, mor- 
phemes can provide cues for meaning, spelling and pronunciation (e.g. 
Carlisle 2003). On the other hand, construction representations link mor- 
phologically related words at a central level, and the presence or absence 
of connections is determined by the degree of semantic/functional rela- 
tionships between the word forms according to their shared morpheme 
(base or affix). A fundamental assumption of this view is that construc- 
tion representations are created/emerge and are stabilized in long-term 
memory according to an ecological rule that imposes family and series 
clustering as an organizational principle of the mental lexicon. To con- 
clude, the claim is that the mental lexicon is constructed according to 
multiple dimensions: the perceptive salience of the word’s morphological 
structure (enhanced by morphonotactics) and its formal-semantic rela- 
tionships with the other coded words, in other words, its syntagmatic and 
paradigmatic dimensions. 
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